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ABSTRACT 


* 

Coding  under  a fidelity  criteria  of  a class  of  sources  which 
emit  randomly  occurring  messages  Is  Investigated.  This  class  of  sources 
models  Information  carrying  processes  entering  Into  communication  net- 
works. Messages  emitted  by  computer  terminals,  teletypes,  vocoders, 
and  other  such  devices  serve  as  actual  examples.  For  this  class  of 
sources  the  rate  distortion  function  is  derived,  and  source  coding  and 
converse  souce  coding  theorems  are  proven.  Employing  these  theorems, 
an  operational  definition  of  the  rate  distortion  function  In  terms  of 
message  queueing  delay,  and  transmission  delay  is  presented.  This 
definition  relates  the  rate  distortion  function  with  the  message  net- 
work delay  which  Is  an  Important  measure  of  performance  of  a communica- 
tion network. 

Also,  for  such  a class  of  sources,  which  emit  randomly  occurring 
messages,  an  adaptive  data  compression  scheme  Is  Investigated.  This 
scheme  utilizes  observations  of  the  network  congestion  to  determine  the 
amount  of  compression  a message  receives,  with  the  object  of  minimiz- 
ing the  message  delay  for  a given  distortion  level.  Using  results  from 

1 

Markov  decision  theory  for  the  case  of  a nondenumerable  state  space  and 
an  unbounded  cost  function,  the  delay  distortion  relationship  is  examined 
for  a tandem  channel  network  which  utilizes  this  adaptive  data  compression  | 
scheme.  This  relationship  represents  the  trade-off  between  message 
delay  and  fidelity  of  the  reproduction  of  the  source.  Using  a queueing 
analysis  of  the  comomnlcation  system  under  consideration,  numerical  results 
for  the  delay  distortion  relationship  are  provided.  Applications  of 
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adaptive  data  compression  to  analog  signals,  such  as  voice  or  telemetry 


waveforms  transmitted  through  a radio  or  satellite  channel,  are  cited 
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CHAPTER  I 


INTRODUCTION 


Results  In  Information  theory  which  pertain  to  the  coding  of 
sources  under  a fidelity  criteria  are  normally  applied  to  sources 
which  emit  signal  functions  on  a regular  temporal  basis.  When 
considering  an  informat ion-theoretic  approach  to  the  coding  of  sources 
found  in  communication  network  applications,  the  existing  source 
models  are  inadequate.  In  this  dissertation,  a source  model  is 
considered  which  more  accurately  models  the  information  processes 
conveyed  by  communication  networks.  For  this  source,  the  corresponding 
rate  distortion  function  is  shown  to  have  an  operational  meaning  in 
terms  of  message  queueing  delays  and  transmission  delays,  and  an 
adaptive  data  compression  scheme  is  developed  and  optimized. 

1.1  Problem  Statement 

A model  of  a communication  system  for  the  conveyance  of  informa- 
tion from  a set  of  sources  to  a set  of  users  is  depicted  in  Figure  1.1. 
This  block  diagram  represents  the  situation  found  in  communication 
network  applications  such  as  computer  and  satellite  communication 
networks.  Information  is  generated  at  the  sources  by  a certain 
random  mechanism.  At  the  source  encoder,  the  messages  are  processed 
by  a data  compressor  which  generates  approximations  to  the 
messages.  The  channel  encoder  adds  redundant  information  to  the 
latter  message  approximations  to  insure  reliable  transmission  through 
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the  communication  network.  The  communication  network  assu'es  the 


responsibility  of  transmitting  the  messages  from  the  source  to  the 
destination.  The  network  consists  of  an  Interconnection  of  communi- 
cation channels  and  buffers.  The  buffers  are  used  to  regulate  the 
bit  rate  through  the  channels  to  insure  acceptable  message  queueing 
delays.  Finally,  the  channel  decoder  and  source  decoder  decipher 
the  Information  transmitted  through  the  network  and  provide  a repro- 
duction of  the  original  message  to  the  users. 

The  sources  are  assumed  to  generate  the  messages  at  random 
instants  in  time.  Examples  of  such  sources  are  timesharing  terminals, 
stock  quotation  terminals,  vocoders,  and  others.  This  randomness 
in  timing  is  the  essential  element  that  differentiates  this  source 
from  the  normal  information  theoretical  model  of  sources.  Thus,  the 
source  encodei;  as  well  as  other  elements  of  the  communication  system, 
must  be  able  to  accommodate  the  resulting  temporal  randomness.  This 
gives  rise  to  associated  queueing  delays  In  the  communication  system, 
which  along  with  transmission  delays  account  for  the  source  to  user 
message  delays. 

The  intent  of  the  described  communication  system  is  to  deliver 
the  messages  from  the  sources  to  the  users  as  rapidly  and  reliably 
as  possible.  One  trade-off,  between  rapid  delivery  of  the  messages 
and  their  accurate  reproduction,  originates  In  the  source  encoder 
and  the  data  compression  scheme.  As  the  amount  of  compressed  informa- 
tion in  a message  is  Increased,  the  resulting  message  delay  Is  reduced. 
This  trade-off  between  delay  and  distortion  is  characterized  by  the  de- 
lay distortion  relationship.  This  relationship  describes  the  minimum 


average  message  delay  as  a function  of  the  average  distortion 
level  attainable  for  a given  system  configuration  and  a given 
class  of  data  compression  schemes. 

A method  of  improving  the  delay  distortion  relationship  is  to 
permit  the  class  of  data  compression  schemes  considered  in  the 
calculation  of  this  relationship  to  Include  schemes  which  have  an 
element  of  adaptivity.  This  allows  for  alteration  of  parameters 
of  the  data  compression  scheme  as  the  congestion  of  the  communication 
network  changes,  and  thereby  reducing  message  delay. 

A parameter  of  interest  associated  with  the  delay  distortion 
relationship  is  the  minimum  distortion  level  which  achieves  a 
prescribed  finite  value  of  the  average  message  delay.  This  quantity 
represents  the  optimal  theoretically  attainable  accuracy  of  repro- 
duction of  the  source.  It  is  described  by  the  rate  distortion 
function,  an  information-theoretic  concept.  This  function  yields 
the  idealized  relationship  between  the  average  rate  of  tranmlssion 
of  information  in  the  network,  and  the  associated  average  distortion. 

In  this  dissertation,  a model  of  sources  found  in  communication 
networks  is  presented.  Using  this  source  model,  the  rate  distortion 
function  is  evaluated,  and  given  an  operational  definition  in  terms 
of  message  delay.  This  interpretation  is  a natural  extension  of  the 
usual  information-theoretic  operational  definition.  Incorporating 
this  source  model,  a communication  system  with  an  adaptive  data 
compression  scheme  is  analyzed.  Using  Markov  decision  theory, 
the  structure  of  the  adaptive  compression  scheme  which  yields  the 
optimal  delay  distortion  relationship  is  determined.  For  network 

4 


) 


1 


0 


configurations  consisting  of  a single  channel  and  channels  in 
tandem,  the  delay  distortion  relationship  is  computed  through  a 
queueing  analysis  yielding  the  average  delay  of  a message  in  the 
communication  system. 

1.2  Historical  Background 

The  problem  of  encoding  sources  under  a fidelity  criteria 
forms  a well  studied  area  (rate  distortion  theory)  of  information 
theory.  The  evaluation  of  the  performance  of  the  encoding  schemes 
in  terms  of  the  rate  distortion  function  was  introduced  by  Shannon  [1], 
[2].  This  function  represents  the  optimal  theoretically  attainable 
rate  for  a given  distortion  level.  Results  incorporating  the  rate 
distortion  function,  see  Gallager  [3],  and  Berger  [4],  are  normally 
applied  to  source  models  which  have  temporal  regularity.  A typical 
example  of  such  a source  is  the  discrete  time  source.  This  source 
is  assumed  to  produce  each  unit  of  time  a letter  which  is  a realization 
of  the  underlying  stochastic  process.  As  defined,  the  rate  distortion 
function  is  Independent  of  delays  accrued  due  to  encoding  and  trans- 
mission. An  element  of  coding  delay  was  added  to  the  rate  distortion 
function  by  Krlch  and  Berger  [5].  This  was  done  by  Introducing  a 
fidelity  criteria  which  depends  upon  both  coding  time  delays,  and 
the  accuracy  of  the  reproduction. 

Considering  the  transmission  of  information,  communication 
networks  designed  for  the  transmission  of  information  in  a digital 
format  are  of  great  current  Interest.  The  major  motivation  for  this 
Interest  is  the  application  of  such  networks  to  computer  comunlcations 


(for  example,  see  the  recent  books  and  collections  of  papers  by 
Davies  and  Barber  [6],  Abramson  and  Kuo  [7],  Green  and  Lucky  [8], 
and  Chu  [9]).  In  these  applications,  a network  is  established  to 
allow  subscribers  to  share  resources  efficiently.  A major  operational 
requirement  for  such  networks  is  that  messages  from  subscribers  be 
transmitted  as  reliably  and  as  rapidly  as  possible.  The  message  proces- 
ses from  the  various  subscribers  enter  the  network  at  any  specific  node. 
At  a node,  the  processes  can  be  modelled  as  a single  message  process 
generated  by  a source  which  emits  messages  at  random  instants  of 
time  (see,  for  example  Kleinrock  [10]).  The  analysis  of  the  resulting 
message  queueing  and  transmission  delays  for  such  networks  can  be 
found,  for  example,  in  Kleinrock  [10],  [11]  and  Rubin  [12],  The 
analysis  of  the  accuracy  of  the  reproduction  of  the  messages  in  the 
network  under  a fidelity  criteria  was  introduced  by  Rubin  [13]. 

In  that  paper,  the  delay  distortion  relationship  was  introduced  to 
display  the  relationship  between  the  delay  of  a message  in  the 
network,  and  the  amount  of  distortion  accrued  by  a message  when  data 
compressed.  The  data  compression  schemes  which  were  considered 
in  [13],  only  included  data  compression  schemes  that  do  not  have 
any  element  of  adaptivity  or  decision  making  capability  associated 
with  them. 

V 

The  analysis  of  communication  networks  which  contain  decision 
making  schemes  to  control  the  network  can  frequently  be  performed 
by  first  modelling  the  state  processes  involved  by  Markov  decision 
processes.  Then  the  network  is  considered  to  be  a queueing  system 
with  a controller  and  the  results  of  Markov  decision  theory  are  used 
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to  analyze  the  system  and  optimize  it.  Results  for  Markov  decision 
processes  with  respect  to  an  Infinite  horizon  deal  with  describing 
under  what  conditions  a stationary  decision  policy  achieves  the  minimum 
discounted  cost  or  the  minimum  average  cost.  Studies  of  such  problems 
can  be  found,  for  example,  in  Blackwell  [14],  Howard  [15],  Derman  [16], 
[17],  Ross  [18],  [19],  Lippman  [20],  [21],  Reed  [22],  Harrison  [23], 
and  Awate  [24].  Only  in  the  papers  of  Lippman  [20],  [21],  Reed  [22], 
Harrison  [23]  and  Awate  [24]  are  Markov  decision  processes  with 
unbounded  cost  functions  considered.  Furthermore,  only  Lippman  [21] 
considers  such  processes  with  non-d enumerable  state  spaces  and 
unbounded  cost  functions. 

Applications  of  the  results  of  Markov  decision  theory  to  the 
optimal  control  of  queueing  systems  are  quite  extensive.  The  review 
articles  of  Crabill,  Gross,  and  Magaine  [25],  Sobel  [26],  and 
Stidham  and  Prabhu  [27]  outline  some  of  the  applications  to  queueing 
systems.  In  general,  problems  dealing  with  the  optimal  control  of 
queueing  systems  consider  the  state  of  the  system  to  be  the  queue 
size  or  some  other  related  denumerable  state  space.  Shaw  [28] 
considered  a nondenumerable  state  space,  the  waiting  time,  in 
working  with  the  control  of  a M/M/1  queueing  system  with  variable 
service  rate.  The  technique  used  to  solve  this  latter  problem  does 
not,  however,  rely  on  the  results  of  Markov  decision  theory. 

1.3  Outline  of  Dissertation 

The  main  results  of  this  dissertation  relate  to  the  coding 
under  a fidelity  criteria  of  a class  of  network  sources.  This  class 
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of  sources  are  found  in  communication  network  applications  and  the 
sources  do  not  have  an  inherent  temporal  regularity. 

In  Chapter  II,  the  optimal  theoretically  attainable  performance 
of  a source  encoding  scheme  for  such  a source  is  studied.  The  model 
of  the  sources  is  described  in  Section  2.1,  and  the  appropriate  rate 
distortion  functions  for  this  model  are  derived  in  Section  2.2. 

In  Section  2.3,  a source  coding  and  a converse  source  coding  theorem 
are  proved,  and  in  Section  2.4,  an  information  transmission  theorem 
and  its  converse  are  proved.  These  theorems  establish  a natural 
operational  definition  of  the  rate  distortion  function  in  terms  of 
source  codes  which  achieve  a finite  average  message  delay. 

In  Chapter  III,  the  relationship  between  the  delay  of  a message 
in  a network  and  the  distortion  accrued  from  data  compression  is 
studied.  The  delay  distortion  relationship  is  introduced  in  Section 
3.1.  An  adaptive  data  compression  scheme  is  described  in  Section  3.2. 

In  Section  3.3,  Markov  decision  processes  are  introduced  and  relevant 
results  from  Markov  decision  theory  are  reviewed.  Also,  an  extension 
of  the  results  of  Markov  decision  theory  to  the  case  of  nondenumerable 
state  space  and  unbounded  cost  functions  is  derived  in  this  section. 

In  Section  3.4,  the  results  of  Section  3.3  on  Markov  decision 
processes  are  applied  to  the  study  of  an  adaptive  data  compression  scheme 
for  a network  consisting  of  a single  channel.  The  optimal  structure 
of  the  scheme  which  achieves  the  delay  distortion  relationship  is 
determined. 

Considering  an  adaptive  data  compression  scheme,  the  delay 
distortion  relationship  for  a network  consisting  of  a single  channel 

8 


is  further  studied  in  Chapter  IV.  In  Section  4.1,  the  results  of 
Chapter  III  are  used  to  specify  the  delay  distortion  relationship 
In  terns  of  an  optimization  of  an  appropriate  functional. 

The  major  properties  of  this  relationship  are  examined  in  Section  4.2. 
In  Section  4.3,  the  functional  to  be  optimized  is  described 
in  terms  of  an  integral  equation.  In  Section  4.4,  a more  mathemati- 
cally tractable  method  for  evaluating  the  functional,  utilizing 
results  from  Markov  chain  theory,  is  discussed.  Numerical  results 
for  the  delay  distortion  relationship  are  presented  in  Section  4.5. 

In  Chapter  V,  a tandem  channel  communication  network  is  con- 
sidered and  the  results  of  Chapters  III  and  IV  on  adaptive  data 
compression  schemes  are  extended  to  this  network.  The  tandem  channel 
network  is  an  accurate  model  of  a satellite  communication  channel 
or  a radio  relay  channel.  The  delay  distortion  relationship  for  this 
network  Is  determined  in  Section  5.1.  The  functional  form  of  this 
relationship  is  shewn  to  be  similar  to  the  functional  form  of  the 
delay  distortion  relationship  for  the  single  channel  network  studied 
in  Section  4.1.  In  Section  5.2,  the  major  properties  of  this 
relationship  are  presented.  In  Section  5.3,  numerical  results  for 
the  delay  distortion  relationship  of  the  tandem  channel  network 
are  presented. 

A summary,  conclusions,  and  suggestions  for  future  research 
are  found  in  Chapter  VI. 


CHAPTER  II 


SOURCE  MODEL  AND  RATE  DISTORTION  THEORY 
FOR  COMMUNICATION  NETWORKS 

The  class  of  sources  usually  considered  in  communication  network 
applications  generate  messages  at  random  instants  in  time.  Examples 
of  such  sources  are  tlmeshare  terminals,  stock  quotation  terminals, 
and  vocoders.  In  this  chapter,  a model  of  these  sources  is  developed, 
and  an  information  theoretic  approach  is  applied  to  the  coding  of 
these  sources.  Specifically,  the  Idea  of  message  delay  is  Introduced 
into  the  operational  definition  of  the  rate  distortion  function. 

In  Section  2.1,  the  model  of  the  sources  is  described,  and 
measures  of  fidelity  are  discussed.  The  rate  distortion  function, 
R(D),  for  this  model  is  derived  and  is  shown  to  have  a second 
interpretation  in  Section  2.2.  Source  coding,  and  converse  source 
coding  theorems  are  proved  in  Section  2.3.  These  theorems  estallsh 
a natural  operation  definition  of  R(D)  in  terms  of  source  codes 
which  insure  a finite  message  delay.  Information  transmission 
theorems  are  established  in  Section  2.4. 

2.1  Source  Model 

Consider  the  communication  system  depicted  in  Figure  1.1. 

The  source  model  which  is  considered  represents  sources  that  generate 
messages  or  letter  at  random  instants  of  time.  This  source  model 
Is  representative  of  information  sources  which  generate  information 


i 


I 


•i 
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In  a bursty  manner,  such  as  timeshare  terminals,  stock  quotation 
terminals,  and  vocoders.  Furthermore,  many  times  It  Is  required 
that  scientific  Instruments  make  measurements  of  phenomenons  that  occur 
Irregularly.  Hence,  the  data  collected  from  the  Instruments  can  be 
modelled  as  a source  that  generates  messages  at  random  Instants  In 
time. 

For  such  sources  which  generate  messages  randomly  In  time, 

the  usual  models  of  sources  found  In  Information  theory,  (see 

Gallager  [3]  and  Berger  [A])  are  Inadequate  to  describe  the  sources. 

To  account  for  this  Inadequacy,  a model  for  the  superposition  of 

the  arrival  processes  of  the  messages  from  the  various  sources  to  the 

source  encoder  needs  to  be  specified. 

The  superposition  of  the  arrival  processes  Is  assumed  to  form 

a renewal  point  process,  where  the  lnterarrlval  times  are  Independent 

and  Identically  distributed.  Let  {t^,  n ■ 1,  2,  3,  ...}  describe 

the  message  lnterarrlval  times  where  {t  } is  a sequence  of  non- 

n 

negative  Independent  random  variables  with  a common  distribution 
function  F^(*)  and  a common  expected  value,  X \ given  by 


td  F (t)  < “> 

T 


(2.1) 


Then  the  sequence  of  arrival  times  (t  , n ■ 0,  1,  2,  ...}  of  the 


renewal  process  Is  given  by 


n 

t " I t.  , n ■ 0,  1,  2,  ...  (2.2) 

n J-0  J 


Furthermore,  define  N(T)  as  the  number  of  messages  arriving  In  the 
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interval  [0,  X],  T >_  0.  Thus,  N(T)  is  given  by 


N(T)  - sup{n  ^ 0:  tn  < T}  (2.3) 

Messages  arriving  at  different  instants  of  time  are  drawn  from  an 
abstract  space,  X.  Each  message,  x e X,  is  considered  a letter  and 
the  letters  are  assumed  to  be  independent  identically  distributed 
random  variables  which  are  statistically  independent  of  the  arrival 
process.  The  reproduction  of  x e X is  a letter  y e Y where  Y is  the 
abstract  reproduction  space.  Subsequently,  an  amount  of  distortion  is 
accrued  when  x e X is  reproduced  by  y e Y which  is  represented  by 
p(x,y).  The  function  p(*,*)  is  a distortion  measure  and  p(x,y)is 
the  cost  of  reproducing  x with  y. 

It  is  assumed  that  the  users  are  not  concerned  with  the  exact 
time  of  arrival  of  a message.  This  is  reflected  by  the  distortion  mea- 
sure not  being  a function  of  the  arrival  time  of  the  message 
(occurrence  times  which  need  to  be  transmitted  are  considered  as  part 
of  the  message).  However,  users  are  concerned  with  the  message  delay 
which  is  an  Important  performance  criteria  for  communication  networks. 
Therefore,  the  message  delay  is  considered  as  a separate  performance 
measure.  Instead  of  combining  the  accuracy  of  signal  reproduction  and 
message  delay  in  a single  performance  criteria  as  in  Kritch  and  Berger 
151. 


2. 2 Rate  Distortion  Function 

The  rate  distortion  function  R(D)  for  sources  emitting  randomly 
occurring  messages  as  described  in  Section  2.1  is  now  derived.  For 


these  sources,  the  number  of  messages  arriving  In  a given  time  period 
Is  random.  In  contrast  to  sources  which  emit  messages  at  regular 
Intervals  of  time.  Using  the  properties  of  renewal  point  processes. 

It  Is  shown  that  the  rate  distortion  function  for  sources  emitting 
randomly  occurring  messages  can  be  defined  in  an  alternative  manner  to 
the  definition  used  for  the  rate  distortion  function  for  sources  with 
temporal  regularity. 

The  computation  of  R(D)  requires  that  the  sample  function  of  the 
source  over  the  interval  [0,  T] , be  represent  by 

*T  " [{xn}*  Un}*  N(T)1  (2,4) 

where  the  sequence  of  messages  is  given  by 

(x  } - {x  c X,  n - 0,  1,  2,  ....  N(T) } , (2.5) 

n n 

the  sequence  of  arrival  times  Is  given  by 

(tn>  - {tn,  n - 0,  1,  2,  ....  N(T) } , (2.6) 

and  N(T)  Is  the  number  of  messages  generated  in  the  Interval  [0,  T] . 
Similarly,  the  sample  function  of  the  reproduction  of  the  source  is 
represented  by 

yT  - [<yn>.  (tn},  ft(T)]  (2.7) 

where  the  sequence  of  reproductions  of  the  messages  Is  given  by 

(yn>  ■ (y„  £ Y,  n ■ 0,  1,  2 N(T)}  , (2.8) 

the  sequence  of  reproductions  of  the  arrival  times  is  given  by 
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{t^}  **  (t^,  n ■ 0,  1,  2,  •••! 


N(T)  } 


(2.9) 


and  ft(T)  is  the  number  of  messages  reproduced.  Furthermore,  let  qT 

be  the  conditional  probability  measure  generating  the  underlying  joint 

probability  measure  between  *T  and  y^. 

Using  this  notation,  the  average  mutual  information  between  the 

source  and  the  reproduction  yT  over  the  interval  [0,  T] , with 

respect  to  q_,  is  denoted  by  I,  (x  ; y_) . The  distortion  accrued  when 
T q_  r T 

mT 

the  sample  function  *T  is  reproduced  by  is  denoted  p^j^,  ?T)  • 

Thus,  from  Berger  [4],  the  rate  distortion  function  R(D)  is  defined  by 

R(D)  - lim  inf  t'1  I (S  ; f ) (2.10) 

T-~  ^TeQj,(0)  qT 


where 

(^(D)  - (q  : E-  [pT(S_,  ?T)]  < D} 

qT 


(2.11) 


The  distortion  measure  PT(*,*)  which  is  employed,  is  the  time 

average  of  the  per-message  distortion  measure  p(*,*)»  given  in 

Section  2.1.  So,  averaging  p(x  , y ) over  time,  the  distortion  measure 

n n 

to  be  employed,  which  is  denoted  p*(x^,  y^),  is  given  by 


-1  N$T) 

T f P(*n.Ju) 

n-1 
i N(T) 

t [ i p<xn»yn)  + <N<T>  - 

n“0 


if  N(T)  » N(T) 

if  fi(T)  < H(T) 

(2.12) 


14 


c 


c 


c 


( 


where 

Pm-„  ■ Inf  E(p(x,y)|y)  + e (2.13) 

max  ycY 

and  e > 0.  The  quantity  p is  assumed  to  be  finite,  which  in  general 

max 

is  the  case  for  most  distortion  measures.  The  term  (N(T)  - N(T))p 

max 

in  (2.12)  is  Included  to  insure  that  the  measure  is  honest  in  the  sense 
that  providing  no  information  about  a group  of  messages  is  suitably 
penalized.  This  honesty  is  required  in  the  following  proposition  which 
establishes  an  alternative  expression  for  the  rate  distortion  function 
Rt(D).  The  latter  is  defined  by  incorporating  the  tine  average  per- 
message  fidelity  criteria  (2.12)  in  (2.10)  - (2.11)  as  follows 

R (D)  - lim  inf  T_1  I (*  ; ?_)  (2.14) 

T-  <JtE^(D)  qT  T T 

where 

Qt(D)  - {qT:  E [pjttj,  ?T>1  £D>  (2.15) 

qT 

Before  stating  the  proposition,  the  average  conditional  mutual 
information  must  be  defined.  Let  the  notation 

IqT(t{xn}*  {tn)1;  l{yn}’  {tn>llN(T)> 

represent  the  average  conditional  mutual  information  between  ({xn},{tn}] 

and  [{y  },{t  }]  conditioned  on  N(T)  with  respect  to  the  marginals  gen- 
n n 

erated  by  Other  forms  of  the  average  conditional  mutual  informa' 

tion  which  are  used  later  have  similar  means. 


Proposition  2.1 


The  rate  distortion  function  Rt(D)  defined  by  (2.14)  is  given 
by 


R (D)  **  lim  inf  T_1  I ({x  };  (y  }|n(T)) 
T-*~  gTeQ^(D)  qT  n n 


where 


Q^(D)  - {qT:  < D,  E^[I(N(T)  = ft(T))]  - 1} 

and 


( 1 , if  N(T)  - ft(T) 

I(N(T)  - ft(T))  = < 

( 0 , if  N(T)  * N(T) 


(2.16) 


(2.17) 


(2.18) 


is  the  indicator  function. 


Proof 

Making  repeated  use  of  an  equality  from  information  theory, 
the  mutual  information  function  can  be  expressed  as  the  sum  of 
various  average  conditional  mutual  information  quantities  as  follows, 

IqT(*TIV  " ft<T>>  + [{yn>.  (fn}]|fl(T)) 

+ Tq  (1{xn}’  {tn}1;  yxlN(T))  (2.19) 

Since  mutual  information  measures  are  positive. 


Now  let  9.J.  be  given  by 


9'  - [fy',  n - 0,  1,  2,  ....  N(T)>,  [t\  n - 0,  1,  2,  ....  N(T) } , N(T)1 
in  n 


where  for  n ■ 0,  1,  2,  N(T) 


(2.21) 


if  n > fl(T) 


otherwise 


(2.22) 


It  If  n > fl(T) 

* 

t otherwise 

n 


(2.23) 


The  quantity  e Y is  selected  such  that  E(p(x,yc)|x)  < Pnflx  and 

t£  Is  an  arbitrary  constant.  Since  y^  contains  less  information 

about  the  original  sequences  {x  },  and  {t  } than  does  the  original 

n n 

reproduction  9T>  then  it  Is  clear  that 

yI{xn},  (tn}];  ?t|n(t)) 

> I_  ([{x  },  (t  }];  [{y*},  {t’}] |n(T))  (2.24) 

— q_,  n n n n 

Furthermore,  from  (2.12),  (2.22)  and  the  definition  of  y , it  Is 

c 

clear  that 


S^(pt(*t,9t))  > E{^(pt(2t,^))  . 


(2.25) 


Suppose  c Qt(D)  Is  generated  by  5^  and  9jt  then  from  (2.21)  9^.  is 
measurable  and  from  (2.25),  £^(3^,9^)  ± D.  So,  5^  and  9.J.  generate 
a conditional  probability  measure  JLJ,  e Q^(D).  Hence  letting  ijy  and 


i 


J 


be  as  above. 


I.  ([{x  },  (t  });  [{y*}f  (t’}]|N(T)) 
q n n n n 

- I_, ([{x  },  (t  }];  [{y  };  {t'}]|N(T)) 

qT  n n n n 

- I-,([{x >,  (t  }];  [{y  ) , {t  }]|N(T)) 

qT  n n n n 

where  the  last  equality  Is  a result  of  {y^}  » {yM  w. p.l  and 


(2.26) 


{t  } ■ {t’>  w.p.l  under  the  conditional  probability  measure  fl*. 
n n i 

Thus,  for  every  <JT  e 0^(D)  there  exists  a e Q^,(D)  such  that 
(2.26)  is  valid.  So,  taking  the  infinimum  of  (2.26)  results  in 


inf  I_  ([{x  },  {t  )];  [(y  },  (tM}|N(T)) 

£ <£(D)  ^ n n 


inf  I-,([{X  },  (t  }];  [ty  h {t  }]|N(T)).(2.27) 
q;eQ;(D)  n n n n 

Subsequently  taking  the  infimum  of  (2.20)  and  using  (2.24)  and  (2.28), 
results  in 


R (B)  > 11»  Inf  T'1  I,  <({.}, (t  m[{y  l,{t .H|H(T))  (2.28) 

1 " T-  8t€Q;(D)  <1  " " ” ” 

An  upper  bound  to  is  given  by 

R (D)  < lim  inf  T_1  I-  (*»,;9t)  , (2.29) 

T-»  qTcQ^(D)  «T 

since  Q^(D)  is  a subset  of  QT(D).  From  (2.19),  if  3T  e Q|(D)  then 

" ^("(D^d))  + y«\i>.«tn}i;i{y.>.{‘tn}1l"(D)  (2,30) 

An  upper  bound  to  I (N(T) ; N(T)),  as  shown  by  Rubin  [7],  is  the 
qT 

entropy  of  a geometric  distribution  of  mean  E[N(T)].  Therefore, 
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I (N(T)  ;N(T))  < «,n  T + En^  + EINLT1I)  + E[N(T)]  En(l  + E_1[N(T)])  . 


Since  for  a renewal  point  processes 


(2.31) 


lira  E{^P->  - X 
T-**>  1 


(2.32) 


as  given  by  the  Elementary  Renewal  Theorem  (Smith  [30]),  then 


11m  T-1  I.  (N(T) ; N(T))  * 0 
T-x»  "t 


(2.33) 


Thus,  using  (2.30)  and  (2.33)  In  (2.29)  and  taking  the  limit. 


R (D)  < lim  inf  T_1  I_  ([{x  } ,{t  } ] ; [{y  } ,{t  } ] |N(T))  . 

C " T-  <^(0)  ° n n ° 


(2.34) 


Hence,  from  (2.28),  Rt (D)  Is  upper  and  lower  bounded  by  the  same 
quantity  which  results  in  the  equality 


R. (D)  - lim  inf  T_1  I_  ([{x  ),{ t >];[ (y^ },{£>] | N (T) ) . (2.35) 


T-*x> 


qTe(iJ,(D) 


To  get  the  final  result,  repeated  use  of  an  equality  frcm 
information  theory  la  made  to  get  for  l[T  e Q^,(D) 


I-  ([{x >,{t  >];[{y  },  (t}]|N(T))  - I_  ({x  );{y  }|N(T) 

?T  n n n n qT  n n 

+ I_  ({x  };{t  }|N(T),  (y  })  + I-  ({t  };{y  }|n(T),  (xn)) 
q_  n n n q_  n n n 


+ tn>  |n(t)  , f*nMyn>> 


(2.36) 


Since  the  sequences  (t  } and  (t  } do  not  Influence  the  distortion 

n n 

measure,  for  every  JJT  e Q^(D)  there  clearly  exists  a e Q^(D)  such 
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that  {tn>  and  (t^  are  statistically  Independent  and  statistically 

independent  of  {x  > and  {y  } under  81  and 
n 'n 

IqT({xn};  {VIN(T))  * I0J<{xn}{  {yn}lN(T))  (2.37) 

Then  from  the  Independence  and  (2.36) 

IqT(I{x„>.{trf>];I{yn}.{tn}lN(T))  - I5T^xn>;{yn}lN(T)) 

- iq_.({*n>;{yn>|N(T)) 

- I^a{xn),{tn}];[{yn},{tn}]|N(T)) 

(2.38) 

Substituting  (2.38)  into  (2.35),  taking  the  infimum  over  i|  e Q^(D) , 
and  noting  that  for  every  qT  e Q'(D)  there  exists  a q^,(D)  e Q^,(D) 
that  satisfies  (2.38),  the  final  result  (2.14)  is  achieved. 

Q.E.D 


Let  the  rate  distortion  function,  r(D),  for  a single  letter 
be  defined  by 


r(D)  - inf  I (X;Y) 
q e Q(D)  q 

where 


Q - (q:  E (p(x,y)  < D)  . 

q ~ 


0 < D < D < p 

— — max  max 


(2.39) 


(2.40) 


Then,  the  rate  distortion  function  of  a source  emitting  randomly 
occurring  messages  is  obtained  in  Proposition  2.2  in  terms  of  r(D) 
First  note  that  from  information  theory  r(D)  is  convex  and  that 


r(D)  is  assumed  continuous 


0 


i C 

l C 


I 

M. 


Proposition  2.2 

The  rate  distortion  function  Rt(D)  of  a source  emitting  randomly 
occurring  messages  with  intensity  A (as  described  in  Section  2.1) 
under  a time  averaged  per-message  distortion  measure  is  given  by 

Rt(D)  - A r(D/A)  (2.41) 

where  r(D)  is  given  by  (2.39). 


Proof 


N 


Let  be  the  conditional  probability  measure  generating  the 

underlying  joint  probability  measure  between  3^  and  ?T  conditioned 

on  the  event  N(T)  ■ N occurring.  Furthermore,  let  I jf({x  )»(y  > | H(T) ) 

denote  the  average  mutual  information  between  {x  ) and  (y  } given 

n n 

N(T)  ” N with  respect  to  q^. 

Now  define  for  all  N > 0 


0f<D) 


iqT:  EqNfpT^xn},^yn}^N^  * N1  1 D 8nd 

E^[I(N(T)  - ft(T)  |N(T)  - N]  - l| 


(2.42) 


and 


PT(D)  - j{DN  > 0,  N « 0,1,2,...}:  £(0^)  < d)  (2.43) 

where  E(°N(T))  18  the  expected  value  of  the  random  varleble  D^tj. 

For  any  IJT  c Q,J,(D)  it  is  clear  that  the  sequence 
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<¥*  A* 



3 


°N  “ EqTIpT({xn},{yn})lN(T)  ■ NJ  N - 0,  1,  2,  ...  (2.44) 

satisfies  {Djj}  e V^(D).  Furthermore,  iJT  can  be  decomposed  in  the 
following  manner: 

00 

- l qj  I(N(T)  - N)  (2.45) 

N-0  1 

where  for  all  N >_  0,  q!J  e Q^0>N)  and  satisfy  (2.44). 

In  a similar  manner,  if  (Dn)  e PT(D)  and  q!jj  e Q^DN^  for  a11 
B i 0,  then  JJT  is  given  by  (2.45)  and  JJT  e <$^(D).  Hence,  it  is  easily 
shown  that 


inf 

3X  e Q^(D) 


\<{xn};{yn}lN(T)) 


inf  E{IflN(T)«x„  >;(yn>|N(T))>  (2.46) 

(Djj)  e t>T(D)  qTU;  n n 

qT  E Q?(V*  N-° 


So,  applying  Fatou's  Lemma  (see  Rudln  [31])  to  (2.46),  the  following 
bound  is  obtained. 


inf  I.  «x «>S<y„>l»(T» 
qT  c Q'(D)  n n 


inf  E{ 


inf 


{D„}  e P_(D)  N(T)  N(T) 


<T,<W 


IqH(T)({*n>;{yi.}lN(T))} 


(2.47) 


Now  let  e be  an  arbitrary  positive  number.  Now  for  every  D and  N 

II  w 

there  exists  a q"  c q”(D)  such  that 

WU  );{y  }|N(T»  - inf  I„n({x  };(y„}|N(T))  (2.48) 

qT  n n N _N/n.  qT  n n 2 

qT  e QT(D) 


3 


3 


3 


7 


) 


3 


1 
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and  a sequence  (6  } e P (D)  such  that 
N T 


N(T)  0N(T)  . mT 

qT  e QT  (DN(T)) 


I N(T)({x  };{y  }|N(T))} 


- inf  E{  inf  + e/2 

<V'V» 

(2.49) 

Hence,  using  {6M}  that  satisfies  (2.49)  and  q!!  e q!!(D„)  that  satisfies 
N 1 1 N 

(2.48),  equation  (2.45)  determines  JJT  c Q^,(D)  that  satisfies 


I^({xn};{yn}|N(T)) 

- inf  E{  inf 

(D  } e V (D)  N(T)  N(T) 

NT  ^ eQj  (Dn(t)) 


IqN(T)<t*n>;{yn},N(T))}  + e 


(2.50) 


Since  e is  arbitrary,  taking  the  inflmum  of  (2.50)  over  c Q^(D), 
the  reverse  inequality  of  (2.47)  is  produced,  and 


inf  I-  ({x  } ;{y  } | H (T) ) 
qT  e Q’(D)  qT  n n 


- inf  E{  inf  I Mm({x  } ;{y  } |N(T)) } (2.51) 

(V  c Pt(D>  q»(T)  , p»(T)(v  ’¥(> 

Now  inf  I N({x  };{y  } | N (T) ) is  equal  to  the  Ntl1  extension  of 

N qT  n n 

qT  e Qt(D) 

the  rate  distortion  function,  with  respect  to  a per-letter  distortion 

measure  p(*,*),  of  a source  with  letters  {x  e X,  n - 0,  1,  2,  ...,  N) 

n 

which  are  Independent  and  identically  diatributed.  So,  it  la  clear 


w 


(2.52) 


IoN«x  };{y  >|N(T))  - N r(DT/N) 
w n n 

the  result  from  Proposition  2.1,  and  substituting  (2.51) 
into  (2.16),  it  is  concluded  that 

- lim  inf  E{(N(T)/T)  r(DM,_.T/N(T)) ) (2.53) 

T—  (Dn)  c Pt(D)  N(T) 

is  a convex  function  of  D,  Jensen's  inequality  gives 

E{(N(T)/T)  r(DN(T)  T/H(T))>  > E((N(T)/T)  r(D/(T_1N(T))}  (2.54) 

which  is  valid  for  all  {D  } c P_(D)  and  all  T > 0.  Hence,  from  (2.52) 

N T 

the  sequence  {D„}  given  by 
N 

Dn  -D/E(t"Vt»  . N - 0,  1,  2,  ... 

is  an  element  of  P.p(D),  and  achieves  the  lnflmum  in  (2.51)  for  all 
T ^ 0.  This  implies  that 

R (D)  - lim  E(N(T)/T)  r(D/E(T_1N(T))  (2.55) 

1 T-^» 

Finally,  using  (2.31)  and  noting  the  continuity  of  r(D),  the  required 
result  (2.41)  is  achieved. 

Q.E.D 

In  evaluating  tha  rate  distortion  function  (2.14)  a limit  is 
taken  with  respect  to  the  time  period  T.  In  an  analogous  manner 
a rata  distortion  relationship,  ftt(D),  can  be  defined  by  taking  a 


inf 

qj  e qJ(D) 

Using 
and  (2.52) 

Rt(D) 
Since  r(D) 


limit  with  respect  to  the  number  of  messages.  For  that  purpose. 


define,  in  an  analogous  manner  to  *T  and  yT»  *N  and  by 

*N  " l{xn}N*  {tnV 


(2.56) 


^N  ’ [{ynV  {£nV 


(2.57) 


where  (x  {y  {t  }„  and  {t  )„  are  sequences  representing  the  first 
nN  n N nN  n n 

N messages,  reproductions,  times  of  arrivals  of  messages  and  reproduc- 
tions  of  the  times  of  arrivals,  respectively.  Then,  let  p*(*,*)  be  the 
distortion  measure  given  by 

V ■ 5 T »<«„.  y„)  . (2.58) 

n-0 

which  la  the  analogous  to  the  distortion  measure  given  by  (2.12). 

Now  ft  (D)  is  defined,  with  respect  to  the  distortion  measure  p!l(*,*), 
t N 


ftfD)  - 11m  inf  (X/N)  I (*.;  * ) 

^ M m A /n \ Q..  W W 


(2.59) 


e QN(D)  “n  " " 

where  la  a conditional  probability  measure  and 
N 

N-l 

^ “ {,N:  ^ I(X/N)  ^ p(xn’  yn)J  -D}  * (2,60) 

"n  n-l 

The  following  proposition  asserts  that  taking  the  limit  with 
respect  to  time  (T  •),  according  to  (2.14),  or  taking  the  limit 
with  respect  to  the  number  of  messages  (N  ->  -),  according  to  (2.58), 
the  same  rate  distortion  relationship  is  achieved. 
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Proposition  2.3 

The  rate  distortion  relationship  &t(D)  defined  by  (2.59)  Is 
equivalent  to  the  rate  distortion  function  Rfc(D)  defined  by  (2.14) 
for  sources  ealttlng  randomly  occurring  messages  described  in 
Section  2.1.  Thus,  for  all  D > 0 


ftt(D)  - Rt(D)  . 


(2.61) 


Proof 


Using  a result  similar  to  (2.38),  Rfc(D)  is  given  by 

R (D)  - lia  inf  (X/N)  I ({x}..;  {yl) 

e Qjj(D)  % n N n N 


(2.62) 


Nov  inf  I {y_}M)  i*  given  by  the  extension 

^ t qn(d)  % n N n N 

of  the  rate  distortion  function  of  a source  with  letters  x e X which 

n 

are  independent  and  identically  distributed.  Hence,  in  a manner  similar 
to  (2.52) 


and 


inf  la  ({xn}N;  {ynV  " N r(D/X> 

*N  e Qjj(D)  qN  “ N n N 


(2.63) 


by  Proposition  2.2. 


ftt(D)  - X r(D/X) 
- Rt(D) 
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(2.64) 


Q.E.D 


I 


Another  distortion  measure,  that  Is  of  a similar  form  to 


Is  the  per-message  distortion  measure  given  by 

-1  NlT> 

N(T)  1 l p (x  , y ) , if  N(T)  < ft(T) 

n n 


N(T) 


N(T)_1[  l P(x„,y„)+(N(T)-N(T))p  ],  if  N(T)  > N(T) 

n“l 


(2.65) 


The  rate  distortion  function,  R (D) , under  this  distortion  measure 

m 

is  defined  as 


where 


R (D)  - lim  inf  I-  (3t_;  JL) 
T*-  clTeQ^(D)  qT 


Q“(D)  - (p”^,  ?x)>  1D}  • 


(2.66) 


(2.67) 


In  a similar  manner  to  the  definition  of  ftt(D),  a rate  distortion 
relationship  R (D)  can  be  defined  as 


Rn(D)  - lim  inf  (A/N)  I (St,,;  ? ) 


(2.68) 


<N  « Q“(D) 


where 


N-l 


qJ(D)  - (N_1  l p(x  , y ))  < Dl  . 

N N *N  n-0 

Proposition  2.4  shows  the  equivalence  between  R (D)  and  ft  (D). 


(2.69) 
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Proposition  2.4 


For  sources  emitting  randomly  occurring  messages  which  are 

described  in  Section  2.1,  the  rate  distortion  function,  R (D) , defined 

n 

by  (2.66)  and  the  rate  distortion  relationship,  R (D),  defined  by 

ID 

(2.68)  are  given  by 

R (D)  - R (D)  - X r(D)  (2.70) 

ID  ID 

where  r(D)  is  given  by  (2.39). 

Proof 

First,  ft  (D)  as  defined  in  (2.59)  is  related  to  R (D)  by 

C ID 

ft  (I>)  - R.  (DX)  . (2.71) 

in  c 

So,  from  Proposition  2.2  and  2.3 

ft  (D)  - X r(D)  (2.72) 

ID 

Now  using  the  same  reasoning  which  is  used  to  develop  (2.53), 

R (D)  is  expressed  as 

ID 

R (D)  - lim  inf  E{N(T)/T  r(D„._.)}  (2.73) 

” T—  {DN}eOT(D)  N(T) 

where  P^(D)  is  given  by  (2.43).  To  evaluate  (2.73)  define  the  set 
A T(e),  e > 0,  by 

A^Xe)  - (N:  N is  a positive  integer,  ||-  - x|  < e)  , (2.74) 

and  the  Indicator  function  I,  . .(•)  by 


V><N) 


1 if  N t A^e) 


0 otherwise 


Letting  PT(e)  be  given  by 


PT(e)  - Pr(N(T)  t A^e)) 


the  following  upper  and  lower  bounds  are  generated  when  e is  chosen 

such  that  P_(e)  < D/D  : 

T 7 raa-. 


Inf  E{(N(T)/T)  r(D  .)I  , .(N(T))} 

{Dn>  e t>T(D)  N(T)  Vg) 


1 inf  E{(N(T)/T)  r(DM._.)} 

{Dn>  e 0t(D)  NQT) 


{D„>  e D (D-D  P_(e)) 
N T max  T 


E{(N(T)/T)  r(DN(T))IAT(e)(N(T))} 


+ r<D»a*>  Pr(Ve»  * 

Since  for  N(T)  e A^(e),  It  is  easily  established  that 


(A  - e)  £ N(T)  /T  < (A  + e) 


So,  substituting  (2.78)  (2.77)  and  using  Jensen's  inequality, 

it  is  easily  seen  that 


(A-c)  (1  - P _(e))  r(D)  < inf  E{(N(T)/T)  r(DM,_.)} 

T (Dn)  e PT(D)  N(T) 

< (A+e)  (1  - PT(c))  r(D  - D^  PT(e))  ♦ ^(D^)  Pt(e)  (2.79) 


Using  a result  from  renewal  theory  (see  Smith  [30]) 


lim  N(T)/T  - X w.p.l,  (2.80) 

T'*» 

it  is  easy  to  show  lim  P (e)  - 0 for  all  e > 0.  Hence,  the  limit 
T-*»  1 

with  respect  to  T of  (2.79)  is  given  by 

(X  - e)  r(D)  < R (D)  < (X  + e)  r(D)  . (2.81) 

— in  — 

Since  e is  an  arbitrary  positive  number,  equation  (2.70)  is  obtained. 

Q.E.D 

Thus,  the  limiting  procedure  used  in  the  definition  of  the  rate 

distortion  function  can  be  applied  in  terms  of  time  (lim)  or  number 

T-w° 

of  messages  (lim).  The  coding  theorems  in  Section  2.3  establish  the 
importance  of  this  result.  They  establish  that  source  coding  proce- 
dures can  be  used  to  encode  blocks  composed  of  a fixed  number  of 
messages,  rather  than  encode  blocks  of  fixed  time  durations. 

2.3  Source  Coding  Theorems 

The  source  coding  theorems  established  in  Information  theory 
(see,  for  example,  Gallager  [3],  Berger  [4])  provide  an 
operational  definition  of  the  rate  distortion  function,  R(D).  For 
a wide  class  of  sources,  such  theorems  establish  the  existence  of  a 
block  code,  used  for  encoding  the  source,  whose  rate  is  R(D)  + e. 
Furthermore,  this  code  yields  a distortion  level  less  than  D + e 
where  c is  an  arbitrary  positive  number.  Converse  coding  theorems 
establish  that  no  block  code,  used  for  source  encoding,  exists  that 
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has  a rate  less  than  R(D)  and  yields  a distortion  level  not  larger 
than  D.  When  considering  sources  that  emit  randomly  occurring  mess- 
ages, which  are  described  in  Section  2.1,  the  above  type  of  source 
coding  theorems  for  rate  distortion  functions  Rt(D)  defined  by  (2.14) 

and  R (D)  defined  by  (2.66)  can  be  proved.  These  theorems  do  not 
in 

make  use  of  the  nature  of  the  communication  network  involved.  In 
this  section,  another  operational  definition  of  the  rate  distortion 
function  for  sources  emitting  randomly  occurring  messages  is  shown 
to  exist  by  proving  a different  set  of  source  coding  theorems  which 
use  concepts  relevent  to  the  operation  of  communication  networks. 

First,  consider  the  normal  scheme  of  encoding  blocks  of  duration 
T of  the  source  described  in  Section  2.1  with  a block  code.  For  the 
implementation  of  such  a coding  scheme  define  a set  N(T)  consisting 
of  values  of  the  number  of  arrivals,  N(T),  in  a duration  T such  that 

Pr(N(T)  i N(T))  < e (2.82) 

where  e > 0.  For  each  N(T)  e W(T)  a different  block  code  is  used 
to  encode  the  messages,  and  also,  N(T)  must  be  encoded  in  a distortion 
free  manner.  Then,  a source  coding  theorem  can  be  derived  in  the 
usual  manner  by  letting  the  block  length  T become  arbitrarily  large. 

The  disadvantages  of  such  an  encoding  scheme  are  that  an  extremely 
large  code  book  is  required  which  adds  complexity,  and  overhead 
information  must  be  included  to  encode  N(T).  Therefore,  it  would 
be  advantageous  to  be  able  to  encode  in  a block  a constant  number  of 
messages  N,  and  thus,  reduce  the  size  of  the  code  book  and  remove 
the  requirement  of  encoding  N(T).  Then  the  question  of  how  to  define 
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the  rate  of  such  a message  must  be  answered. 

This  question  is  answered  by  observing  that  in  any  communication 
network  buffers  are  placed  at  various  positions  in  the  network.  The 
purpose  of  the  buffers  is  to  smooth  the  flow  of  data  by  storing 
the  randomly  arriving  data  streams  prior  to  transmission.  Making  use 
of  a buffer  which  follows  the  source  encoding  procedure,  the  encoded 
version  of  the  source  is  assumed  to  be  placed  in  a buffer.  The 
information  stored  in  the  buffer  is  further  assumed  to  be  released 
or  transmitted  from  the  buffer  at  a constant  rate,  r,  when  the  buffer 
is  not  empty.  So,  two  measures  of  the  performance  of  such  a block 
encoding  scheme,  which  encodes  a source  using  a fixed  number  of 
messages,  are  r and  the  average  distortion  level  the  code  yields. 

A third  element  of  the  performance  of  such  a message  block 
encoding  scheme  is  the  average  message  delay,  of  a message  due  to 
source  encoding.  The  quantity  y represents  the  average  time  it  takes 
a message  after  being  encoded  to  leave  the  buffer  which  follows  the 
source  encoder  relative  to  the  time  of  arrival  of  the  message.  The 
three  components  of  y are  the  delay  introduced  in  collecting  a fixed 
block  of  messages,  the  queueing  delay  in  the  buffer,  and  the  time  to 
transmit  the  encoded  message  out  of  the  buffer.  It  is  clear  that  y 
must  be  finite  to  insure  the  realizability  of  the  encoding  scheme. 
Since  without  this  restriction,  the  rate  r could  be  arbitrarily 
small,  and  the  code  would  still  yield  a distortion  level  D,  but 
Information  would  be  accumulating  in  the  buffer  causing  the  limiting 
average  message  delay  to  be  unbounded. 


Using  this  concept  of  delay.  Theorem  2.1  and  Theorem  2.2 

establishes  an  operational  definition  of  the  rate  distortion  function 

R (D)  or  R (D).  This  definition  states  that  R (D)  or  R (D)  is  the 
t in  t m 

smallest  rate  of  transmission  out  of  the  buffer  such  that  the  limiting 
average  message  delay,  y,  Is  finite  and  the  code  used  yields  a dis- 
tortion level  not  exceeding  D. 

Theorem  2.1  (Source  Coding  Theorem  for  Network  Message  Sources) 

Consider  the  network  message  source  described  in  Section  2.1 

with  finite  first  and  second  moments  of  the  interarrival  time.  Let 

p(*,*)  be  a given  single  letter  distortion  measure. 

Let  R (•)  of  equation  (2.66)  denote  the  rate  distortion  function 
m 

of  the  source  with  respect  to  the  average  message  distortion  measure 
(2.65).  Then  given  any  e > 0 and  any  D ^ 0,  there  exists  a (D  + e) 
admissible  message  block  :ode  and  a rate  of  transmission  r <_  R(D)  + c 
which  guarantees  a finite  limiting  average  message  delay. 

Proof 

Consider  (n,  m)  message  block  coding  schemes  which  code  n 
messages  using  a code  containing  2m  code  words.  From  the  source 
coding  theorem  for  sources  from  abstract  alphabets  in  Berger  [4], 
it  is  shown  that  for  any  c > 0,  and  any  D > 0 there  exists  m and  n 
such  that  a (n,  m)  coding  scheme  is  (D  + c)  admissible  and 

m < n[r(D)  + (2.83) 

where  r(D)  is  given  by  (2.39). 
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Using  a (n,  m)  coding  scheme,  the  buffer  which  receives  the 
code  words  behaves  as  the  queue  in  a GI/D/1  queueing  system  whose 
interarrival  times  are  the  corresponding  sum  of  the  arrival  times 
of  n messages,  and  whose  service  times  are  equal  to  m/r.  Fixing 
e > 0,  and  D >_  0,  and  letting 

r - R (D)  + e - A r(D)  + e , (2.84) 

ID 


for  some  message  block  coding  scheme,  (n,  m),  which  is  (D  + e) 
admissible,  the  ratio  p of  the  mean  service  time,  m/r,  to  the  mean 
interarrival  time,  nA**1,  is  bounded  using  (2.83),  and  (2.84)  by 


P 


A/n  < A[r.(Dj±.-E-AJ  < , 
r/m  Ar(D)  + e 


(2.85) 


Since  p < 1,  the  result  of  Kingman  [32]  for  a GI/G/1  queue  can  be 
used  to  bound  the  limiting  average  queueing  delay,  V,  of  a message 
in  the  buffer  by 


W < 


2 2 
no,  + o_ 

T S 


2nA_1(l-p) 


(2.86) 


2 

The  quantity  oT  is  the  variance  of  the  message  interarrival  time 
2 

and  the  quantity  og  is  the  variance  of  the  service  time  which  is 
equal  to  zero  for  the  case  under  consideration  since  the  service 
time  is  deterministic.  This  implies  that  W is  finite  since  the 
first  two  moments  of  the  message  Interarrival  time  are  finite. 
Furthermore,  for  any  (n,  m)  coding  scheme  and  r > 0,  the  transmission 
delay  and  the  delay  associated  with  collecting  n messages  are  finite. 
Consequently,  a message  block  code  can  be  found  whose  limiting  average 


delay  y,  which  is  the  sum  of  the  three  delay  components.  Is  finite. 

The  corresponding  converse  of  the  theorem  is  stated  in 
Theorem  2.2. 

Theorem  2.2  (Converse  Source  Coding  Theorem  for  Network  Message  Sources) 
Under  the  conditions  of  Theorem  2.1,  there  exists  no  D-admlsslble 
message  block  code  with  an  associated  transmission  rate,  r < R (D) 

IB 

which  guarantees  a finite  limiting  average  message  delay. 

Proof 

As  in  the  proof  of  Theorem  2.1,  consider  a message  block  coding 
scheme  which  codes  n messages  using  a code  containing  2m  code  words. 

From  the  converse  to  the  source  coding  theorem  for  sources  from 
abstract  alphabets  in  Berger  [4],  it  is  established  that  there  exists 
no  D-admlsslble  (n,  m)  block  code  encoding  n messages  into  m bits 
with 

m < n r(D)  (2.87) 

Now  assume  that  there  exists  a D-admissible  (n,  m)  code  with 

m > n r(D)  . (2.88) 

The  buffer  that  follows  the  encoder  behaves  as  a queue  in  a Gl/D/1 

■ -1 

queueing  system  with  mean  interarrival  time  nA  and  with  mean  service 
time  m/r.  Letting  / 
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r 


the  ratio  p of  the  mean  service  tine  to  the  mean  Interarrival  time  Is 


bounded  using  (2.88)  and  (2.89)  by 


■ “/£  > * r(D) 

n7l  R (D) 
in 


(2.90) 


Consequently,  substituting  the  value  of  R (D)  from  (2.70),  p > 1 Is 

in 

obtained.  From  Llndley's  theorem,  Lindley  [33],  for  p > 1,  the  limit- 
ing average  queueing  delay  is  infinite.  Therefore,  y Is  infinite. 

Q.E.D 


These  requits  extend  readily  to  the  time  average  per-message 
distortion  measure  and  the  associated  rate  distortion  function  R((D) 
defined  in  (2.41).  Corollary  2.1  establishes  the  source  coding 
theorem  and  Its  converse  for  this  case. 

Corollary  2.1 

If  In  Theorems  2.1  and  2.2  the  rate  distortion  function  Is 
replaced  with  the  rate  distortion  function,  R^(D),  defined  with 
respect  to  the  time  average  per-message  distortion  measure  (2.12), 
then  the  theorems  remain  valid. 

Proof 

Since  Theorems  2.1  and  2.2  are  valid  for  all  values  of  distortion, 
they  must  be  valid  for  all  distortion  values  of  the  form  D/1  where 
D ^ 0.  Now  a D/X-admlsslble  message  block  code  is  a time  average 


D-admissible  message  block  code.  So,  by  Propositions  2.2  and  2.4 


R (D)  - R(D/X)  (2.91) 

C ra 

and  the  result  follows. 

Q.E.D 


From  the  proofs  of  the  theorems  a number  of  observations  can 
be  made.  As  the  excess  distortion  approaches  zero  (e  0),  the  limit 
of  the  average  message  delay  becomes  arbitrarily  large.  Furthermore, 
as  e -*■  0,  the  percentage  of  time  the  buffer  is  empty  becomes  arbitr- 
arily small,  and  idle  periods  in  transmission  occur  infrequently. 

Hence,  information  is  being  transmitted  from  the  buffer  at  a uniform 
rate,  r - R(D)  + e almost  all  the  time. 

2.4  Information  Transmission  Theorems 

The  source  coding  theorem  and  its  converse  stated  in  Section  2.3 
establish  bounds  on  the  performance  of  source  encoders.  These  theorems 
together  with  the  nolsy-channel  coding  theorem,  Gallager  [3]  and  its 
converse,  establish  bounds  on  the  performance  of  a communication 
system.  The  communication  system  to  be  considered  is  depicted  in 
Figure  2.1  where  the  system  is  restricted  to  a single  source-user 
pair  and  the  communication  network  consists  of  a single  discrete 
memory less  channel  with  channel  capacity,  C.  The  information  trans- 
mission theorem  and  its  converse  theorem  which  follows,  specifies 
for  the  communication  system  and  a source  emitting  randomly  occurring 
messages,  the  theoretical  limit  of  the  fidelity  of  the  source 


r 


37 


reproduction  at  the  user 


Theorem  2.3  (Information  Transmission  Theorem  for  Network  Message 
Sources) 

Let  the  single  lettered  distortion  measure  p(*,*)  be  bounded  by 

P.  Then  for  all  e > 0,  for  the  source  of  Theorem  2.1  and  for  any 

discrete  memory less  channel  of  capacity  C,  If  C > R (D)  + e,  there 

m 

exists  a message  block  code  which  can  reproduce  the  source  with 
fidelity  (D  + c)  at  the  output  of  the  channel  and  can  guarantee 
a finite  limiting  average  message  delay. 

Proof 

Assume  the  discrete  memoryless  channel  can  transmit  one  of  a 
finite  number  of  channel  symbols  and  the  rate  of  transmission  Is 
r[channel  symbols /sec ] . Now  consider  the  coding  scheme  which  first 
uses  a message  block  code  to  encode  the  source,  and  then  the  resulting 
code  words  are  channel  encoded  using  a block  code  and  sent  to  a 
buffer  to  await  transmission.  Certainly,  the  result  of  the  two 
coding  operations  is  a message  block  code. 

C To  evaluate  the  performance  of  such  a scheme,  the  source  coding 

theorem  and  the  noisy  channel  coding  theorem  are  used.  Assume  e > 0 
and  D > 0.  Using  Berger's  [4]  result  for  coding  of  sources  with 
abstract  alphabets,  there  exists  a (D  + e/2)-admlsslble  message 
block  code,  (n,  m) , which  encodes  n messages  and  has  2m  code  words 
where 

m < n[r(D)  + e/2X]  - ^ lRn(D)  + |].  (2.92) 


C 


39 


Since  the  sequence  of  arriving  messages  are  Independent  and  Identically 
distributed,  the  sequence  of  code  words  are  independent  and  identically 
distributed.  Thus,  the  nolsy-channel  coding  theorem,  Gallager  [3], 
for  a discrete  memory less  channel  applies.  It  states  that  a block 
code,  (N,  M) , can  be  found  that  encodes  N code  words  in  a distortion 
free  manner  into  M channel  synbols  such  that 


e m*N 

C-2<_M"*  r 


(2.93) 


(block  error  at  the  receiver)  < e/2P 


(2.94) 


when  the  channel  symbols  are  transmitted  at  rate  r.  Therefore,  the 
scheme  utilized  is  (D  + e) -admissible,  since  the  sum  of  the  distortion 
due  to  source  encoding  and  due  to  channel  errors  is  less  than  D + e. 
Now  the  channel  and  buffer  represent  a GI/D/1  queueing  system  where 
a channel  code  word  is  the  commodity  to  be  serviced.  The  inter- 
arrival  time  of  channel  code  words  is  the  sum  of  nN  message  inter- 
arrival  periods.  Thus,  for  this  system  the  service  time  is  Mr  1 (a 
deterministic  value),  the  mean  Interarrival  time  is  nN*-1,  and  the 
ratio  p of  these  quantities  is  bounded  using  (2.93)  by 


r-l  Rb(D)  + e/2 
< C - e/2 


(2.95) 


Since 


R (D)  ♦ c < C , 


(2.96) 


p < 1 is  obtained.  As  in  Theorem  2.1,  cne  condition  that  the  second 
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moment  of  the  message  Interarrival  Is  finite  implies  that  the  limiting 
average  message  delay  is  finite  for  a message  block  code. 

Q.E.D 

Similarly,  the  converse  theorem  states  the  following. 

Theorem  2.4  (Converse  Information  Transmission  Theorem  for  Network 
Message  Sources) 

It  is  impossible  to  reproduce  the  source  of  Theorem  2.2  with 

fidelity  D at  the  receiving  end  of  any  discrete  memoryless  channel 

of  capacity  C < R (D)  using  a message  block  code  which  achieves  a 
■ m 

finite  limiting  average  message  delay. 

Proof 

As  in  Theorem  2.3,  assume  the  discrete  memory less  channel  can 
transmit  one  of  a finite  number  of  channel  symbols  and  the  rate  of 
transmission  is  r[channel  symbols/sec].  Now  consider  a block  code 

N 

which  codes  n messages  using  a code  containing  2 code  words  where 
each  code  word  is  represented  by  M channel  symbols.  From  the  converse 
to  the  source  coding  theorem  for  sources  from  abstract  alphabets 
(Berger  [4]),  for  the  block  code  to  be  D-admlssible,  it  must  satisfy 

N > n r(D)  - £ R (D)  (2.97) 

Now  the  buffer  and  channel  form  a GI/D/1  queueing  system  with  the 
transmission  time  of  a code  word  being  equal  to  the  service  time  of 
the  queueing  system.  For  this  system,  the  service  time  Is  Mr  \ 


J 


the  mean  Interarrival  time  is  nA  \ and  the  ratio  p of  these  quanti- 
ties is  given  by 

-1 


P ” 


Mr 


nA 


-1  * 


(2.98) 


There  are  two  cases  to  be  considered.  First,  the  case  p >_  1, 
which  implies 


Mr-1  > nA-1 


(2.99) 


By  Llndley's  Theorem  (Lindley  [33]),  the  limiting  average  message 
queueing  delay  is  infinite,  which  implies  the  limiting  average  message 
delay  is  infinite.  In  the  second  case,  p < 1 which  implies 


Mr-1  < nA-1 


(2.100) 


Now  the  amount  of  information  transmitted  per  unit  time  is,  R,  given 
by 


R - 


I VD> 


-1 


(2.101) 


and  by  (2.100) 


Mr 


R > R (D) 

IB 


(2.102) 


But  for  R (D)  > C,  R > C and  therefore  from  the  Converse  Information 
in 

Transmission  Theorem  (Berger  [4])  the  block  code  does  not  yield  a 
distortion  level  less  than  D. 

Q.E.D 
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As  In  Section  2.3,  the  results  extend  to  the  time  average 
per-message  distortion  measure  and  the  associated  distortion  function 
defined  in  (2.14).  Corollary  2.2  establishes  the  information  trans- 
mission theorem  and  its  converse  for  this  case. 

Corollary  2.2 

Theorems  2.3  and  2.4  remain  valid,  if  the  rate  distortion  function 
is  replaced  with  the  rate  distortion  function,  Rt(D),  incorporating 
the  time  average  per-message  distortion  measure  (2.12). 

Proof 

Same  technique  as  Corollary  2.1. 


The  rate  distortion  function  Rm(D)  is  given  an  operational  defini- 
tion in  terms  of  a communication  system  by  Theorems  2.3  and  2.4.  The 

C 

theorems  show  that  transmission  of  Information  using  a message  block 
code  with  a fidelity  constraint  D and  with  a finite  average  message 

' delay  constraint  over  a channel  of  capacity  C is  possible  if 

C 

C > R (D)  and  is  Impossible  if  R (D)  > C.  If  R (D)  > C,  then  trans- 
m in  — in  — 

mission  can  be  achieved  if  one  of  the  two  constraints  is  removed. 

( 

2.5  Conclusions 

This  chapter  discussed  the  performance  bounds  of  source  encoding 
under  a fidelity  criteria  for  sources  fotxid  in  communication  net- 
works. The  source  model  of  network  message  sources  was  presented. 
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The  rate  distortion  function  for  this  source  was  calculated  and  was 
shown  to  be  equal  to  another  rate  distortion  relationship.  Source 
coding  and  converse  source  coding  theorems  were  proved.  These 
theorems  provided  an  operational  definition  of  the  rate  distortion 
function  in  terms  of  the  existence  of  message  block  codes  which 
satisfy  the  distortion  criteria,  and  have  a finite  limiting  average 
message  delay  associated  with  them.  This  chapter  concluded  with  a 
discussion  of  the  information  transmission  theorems  for  network 
message  sources. 

3 

1 
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CHAPTER  III 


DELAY  DISTORTION  RELATIONSHIPS  AND  ADAPTIVE  DATA  COMPRESSION 

The  proofs  of  the  information  transmission  theorems  in  Chapter  II 
have  indicated  how  to  construct  coding  theorems  which  approach  the 
performance  predicted  by  the  rate  distortion  function.  These  schemes 
have  an  arbitrary  large  message  delay  associated  with  them.  Large 
message  delays  imply  a structurally  complex  communication  system 
(large  buffer  sizes  required),  and  degraded  service  to  the  subscribers 
to  the  network.  Therefore,  for  the  design  of  actual  networks,  a more 
meaningful  performance  measure  than  the  rate  distortion  function 
is  the  delay-rate-distortion  relationship.  The  latter  gives  the 
minimum  average  message  delay  when  constraints  are  placed  on  both 
the  rate  of  transmission,  and  the  fidelity  of  the  reproduction  of  the 
source.  For  a given  network  configuration  with  a fixed  rate  of 
transmission,  this  relationship  reduces  to  the  delay  distortion 
relationship  which  this  chapter  discusses.  The  network  configuration 
and  the  delay  distortion  relationship,  y(D)*  ate  introduced  in 
Section  3.1.  In  Section  3.2,  an  adaptive  data  compression  scheme  is 
presented.  This  scheme  utilizes  a decision  policy  to  minimize  the 
average  message  delay  for  a given  distortion  level  D,  resulting  in 
the  delay  distortion  relationship  YA(D).  The  techniques  from  Markov 
decision  theory  that  pertain  to  the  analysis  of  the  adaptive  data 
compression  scheme  are  presented  in  Section  3.3.  Finally,  using 
these  techniques  the  structure  of  the  optimum  decision  policy  for  a 


r 


r 
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single  channel  network  is  determined  in  Section  3.4.  This  network 
is  an  adequate  model  of  a satellite  channel  or  a radio  relay  channel. 
The  technique  developed  here  to  determine  the  optimum  policy  can  also 
be  applied  in  the  optimization  of  controls  in  a variety  of  other 
queueing  and  flow  control  problems  found  in  networks. 

3.1  The  Delay  Distortion  Relationship  and  the  System  Model 

In  a communication  network,  messages  which  are  transmitted  to 
distant  users  accrue  time  delays.  As  the  information  contents  of  the 
messages  increases,  the  demands  on  the  network  are  heightened  and  the 
message  delays  are  Increased.  For  many  message  sources,  such  as  analog 
waveforms  (speech  signal,  telemetry  waveform),  to  achieve  an  acceptable 
level  of  message  delay  for  the  transmission  of  a message  through  an 
existing  network,  the  messages  must  be  approximate  prior  to  trans- 
mission. The  messages  are  approximated  by  other  messages,  termed 
compressed  messages,  whose  information  contents  are  smaller  than  the 
information  contents  of  the  original  messages.  Since  the  compressed 
messages  are  approximations  to  the  original  messages,  the  fidelity 
of  the  reproduction  of  the  messages  is  degraded.  However,  at  the 
same  time,  the  message  delay  has  been  reduced.  As  an  example,  consider 
the  transmission  of  voice  or  other  random  analog  waveforms  through  a 
packet-switching  s to re-and- forward  communications  network,  such  as 
a computer  communication  network  described  in  Kleinrock  [10],  or 
through  a TOMA  system  such  as  a satellite  communication  network 
described  in  Abraawon  and  Kuo  [7],  From  information  theory,  the 
entropy  rate  of  such  a source  is  unbounded,  resulting  in  unbounded 
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message  delays  for  any  network  realization  If  the  messages  are 
not  approximated.  Hence,  an  appropriate  method  of  approximating  or 
data  compressing  the  messages  is  required,  to  insure  that  tolerable 
levels  of  message  delay  and  fidelity  are  achieved. 

To  study  the  trade-off  between  delay  and  distortion,  Rubin  [13] 
introduced  the  delay  distortion  relationship,  y(D).  For  a given 
source  model,  network,  and  routing  procedure  associated  with  the 
network,  this  relationship  specifies  the  minimum  message  delay  achieved 
when  choosing  a scheme  among  all  data  compression  schemes  belonging 
to  some  class  of  data  compression  procedures  which  achieve  a fixed 
distortion  level.  In  order  to  define  y(D),  let  T(dc)  be  the  message 
delay  for  the  given  models,  and  for  a data  compression  scheme  dc. 
Furthermore,  let  DC  be  a class  of  data  compression  schemes.  Then  the 
delay  distortion  relationship  is  given  by 

y(D)  - inf  T(dc)  (3.1) 

dceDC(D) 

{dc  c DC:  average  distortion  level  associated  with  dc  £ D}. 

(3.2) 

The  source  model  considered  in  the  ftudy  of  y(D)  is  the  model 
described  in  Section  2.1  which  emits  randomly  occurring  messages. 

The  renewal  process  which  describes  the  message  arrivals  is  assumed 
to  be  a Poisson  process.  Thus,  the  lnterarrlval  distribution  function 
is  given  by 

Ft(t)  - 1 - e"AT  , r > 0 . (3.3) 

47 


where 
DC(D)  - 


I 


where  X 1 is  the  mean  interarrival  time.  This  model  of  the  message 
arrival  process  is  used  extensively  in  the  analysis  of  communication 
networks  and  queueing  systems.  Further  justification  for  the  Poisson 
assumption  are  the  limit  theorems  in  Khinchine  [34].  These  theorems 
provide,  under  certain  regularity  conditions,  that  if  the  arrival 
processes  of  various  sources  are  independent,  then  the  superposition 
of  the  arrival  processes  approaches  (as  the  number  of  processes 
increases  and  the  individual  process  rate  decreases)  a Poisson  process. 

The  fidelity  of  the  reproduction  of  the  source  at  the  user  is 
measured  via  the  average  per-message  distortion  measure  given  by 

-1  N 

D - lim  sup  N Id  (3.4) 

N-*»  n-0  n 

where  dn  is  the  average  distortion  associated  with  the  nth  message. 

So,  d is  given  by 
n 

dn  ’ E[P(V  yn)]  (3*5) 

where  p(*,*)  is  the  per-message  distortion  measure  described  in 

Section  2.1,  x e X is  the  nth  message,  and  y e Y is  the  nth  repro- 
n n 

duct ion. 

The  network  considered  is  an  Isolated  path  in  a communication 
network  which  operates  in  a store-and-forward  manner  depicted  in 
Figure  3.1.  The  path  consists  of  M channels  connected  by  nodes  which 
store  messages  received  in  an  Incoming  channel  until  the  corresponding 
outgoing  channel  is  free  to  retransmit  the  message.  The  messages  are 
then  transmitted  on  a first  received,  first  transmitted  basis.  Each 
channel  or  branch,  b , is  assumed  to  transmit  information  bits  about 

ID 
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the  source  at  a rate  C , where  C la  assumed  to  be  lese  than  the 

m n 

capacity  of  the  m channel.  The  channel  model  Is  assumed  to  Incor- 
porate the  appropriate  channel  and  decoding  procedures  which  allow 

the  channels  to  be  regarded  as  noiseless.  Thus,  C_1[sec/bit]  Is  the 

n 

transmission  delay  in  the  mtfl  channel. 

The  class  of  data  compression  schemes  considered  utilizes  data 
compressors  which  operate  on  each  message  Independently  of  the  other 
messages.  These  data  compressors  are  assumed  to  satisfy  a rate 
distortion  relationship  r(D) [bits/message]  where  D Is  the  average 
distortion  accrued  per-message  after  compression.  Thus,  the  trans- 
mission time  In  the  m^  channel  of  a message  compressed  by  a data 
compressor,  with  an  associated  distortion  level  D,  is 
r(D)  [sec /mess ] . 

3.2  Adaptive  Data  Compression 

For  the  source  and  network  model  described  In  Section  3.1, 
there  are  many  classes  of  data  compression  schemes  that  warrant 
consideration.  These  schemes  vary  from  deterministic  processing 
schemes  to  schemes  which  dynamically  adapt  to  the  sources,  and  the 
state  of  the  network.  The  delay  distortion  relationship,  defined  in 
Section  3.1,  is  a means  by  which  the  relative  performance  of  various 
classes  of  schemes  can  be  evaluated. 

First,  consider  a deterministic  scheme  which  uses  a single  data 
compressor  with  distortion  level  D.  Assise  the  data  compressor  Is 
selected  from  the  class  of  fixed  length  data  compressors  which  yield 


c 


a per-message  rate  distortion  relationship  r(D)  where  r(D)  is 

described  in  Section  3.1.  Then  clearly,  the  message  queueing  delay 

is  equivalent  to  the  queueing  delay  of  a customer  in  a M/D/1  tandem 

queueing  system  with  Poisson  arrivals  of  rate  X,  and  message  service 

times  in  the  mth  channel,  m ■ 1,  2,  ...,  M,  equal  to  r(D)  C~^.  From 

in 

Rubin  [12]  the  limiting  average  message  delay,  T^(D),  associated 
with  this  data  compression  scheme  is  given  by  the  sum  of  the  trans- 
mission delay  and  the  queueing  delay,  and  so. 


T^D)  - 


where 


M , r(D)c"t  . 

I {(D)  C'1  + J4” 

m*l  min 


rrar  > lf  D - W1' 


C . - min  C 

min  n 


»e[l,2 M] 


D-in(X>  " r <C»in/X)  ‘ 


The  delay  distortion  relationship,  Y^O)),  for  such  a scheme 
which  uses  a single  data  compressor  is  given  by  minimising  the  limit- 
ing average  message  delay  over  all  data  compressors  in  the  class  of 
data  compressors  which  yield  a distortion  level  of  D or  less.  So, 
from  (3.1) 


yJ(D)  - inf  T^fl) 


(3.9) 
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) 


> 


Then  assuming  r(D)  to  be  a strictly  decreasing  function  of  D and 
using  (3.6),  the  delay  distortion  relationship  reduces  to 

y"(D)  - Tj  <D) 


“ D ” “.in*1) 


1£  D - D.1»(X) 


(3.10) 


which  Is  easily  evaluated. 

The  scheme  using  a single  data  compressor  can  be  extended  by  the 
Introduction  of  K-l  additional  fixed  length  data  compressors.  So, 
a message  arriving  proceeds  Immediately  to  one  of  the  K data  com- 
pressors. Following  Its  compression,  the  message  enters  a buffer 
and  awaits  transmission  In  the  network.  The  decision  policy  x which 
directs  the  messages  to  the  various  data  compressors  Is  an  element 
of  a wide  class  of  decision  policies  II  which  is  specified  later.  The 
policy  can  be  random  or  deterministic,  stationary  or  time  varying, 
dependent  or  Independent  of  the  past  history.  The  major  restriction 
placed  upon  this  policy  Is  that  It  be  causal.  Thus,  It  can  not  be 
dependent  upon  explicit  knowledge  of  future  events.  Such  a scheme 
Is  depicted  In  Figure  3.2. 

For  such  a scheme  to  operate  It  is  Imperative  that  the  decoder 
and  user  be  Informed  which  data  compressor  was  used  to  operate  on 
a given  message.  Without  this  knowledge,  the  performance  of  all  but 
the  simplest  deterministic  policies  ara  considerably  degraded.  To 
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provide  this  information,  protocol  information  needs  to  be  transmitted 
through  the  network.  The  protocol  information  used  to  Inform  the 
decoder  is  modelled  as  a fixed  bit  length  segment  p which  is 

Iv 

attached  to  the  bit  length  of  the  compressed  messages.  Hence,  the 

total  bit  length  of  a message  following  compression  by  a compressor 

with  associated  distortion  level  D is  r(D)  + p . 

Observe  that  the  limiting  average  message  delay  and  the  average 

distortion  level  for  a scheme  using  K data  compressors  may  not  exist 

for  all  policies  under  consideration.  So,  in  order  to  determine  the 

delay  distortion  relationship,  let  w be  the  waiting  time  of  the 

n,m 

nth  message  in  the  buffer  associated  with  the  mC^  channel,  and  let 
dQ  be  the  average  distortion  level  associated  with  the  data  com- 
pressor that  compresses  the  n^  message.  Then  the  average  delay  as 
a function  of  the  policy  ir  and  the  collection  of  average  distortion 

levels  {D^,  k - 1,  2 K}  associated  with  the  K data  compressors 

is  expressed  by 

N**l  M 

T*J({IX  })  - lim  sup  N"1  l E A l (w„  „ + (r(d„)  + pJc'1]}  (3.11) 

* k n-0  * m-1  n,m  “ K m 

where  all  the  buffers  are  initially  assumed  to  be  empty.  This  expres- 
sion assumes  that  the  total  message  arrives  at  the  Instant  of  arrival 
and  that  no  delay  is  accrued  from  the  compressing  procedure  Itself. 

In  a similar  manner,  the  average  distortion  is  given  by 

r i N_1 

Dv({V>  “ lim  8UP  N l Md„>  * 0-12) 

* k If-  n-0  W n 

Then,  the  delay  distortion  relationship  for  schemes  which  use 
K data  compressors  is  given  by 

54 


J 


l 


: 


> 


* 


) 


Yk<D)  - *nf  T*({D.})  (3.13) 

iren(D;{D  ) v k 

{Dk>0,k-r,2,...,K} 

where 

n(D;  {D^})  - {it  e II:  D*({Dk>)  <_  D}  . 

M 

Taking  the  minimum  of  YR(D)  over  all  possible  values  of  K,  the  delay 

M 

distortion  relationship  Y^(D)  for  adaptive  schemes  is  expressed  by 

y“(D)  - rain  y”(D)  . (3.14) 

* Ke  [ 1 , 2 , . . . ] * 

There  must  be  a finite  K attaining  the  minimum  in  (3.14).  This  is 
due  to  the  protocol  information,  pR,  being  an  Increasing  function  of 
K and  so,  eventually,  as  K increases,  the  message  delay  in  the  network 
will  be  unbounded  due  to  the  excessive  amount  of  protocol  information. 
Thus,  the  optimal  adaptive  scheme  uses  a finite  number  of  data 
compressors. 

M M 

In  order  to  compute  YA(D)»  the  problem  of  finding  YR(D)  must  be 
considered  first.  Once  Y^(D)  is  determined,  Y^D)  is  the  lower 

envelope  of  the  curves  y”(D),  K - 1,  2 To  evaluate  Y^(D) , 

Introduce  the  Lagrange  multiplier  p ^ 0 and  define  the  function 

Lw<{V> by 

- l£((Dk»  - T*({Dk»  + p D*((Dk»  (3.15) 


So,  if  the  infimum  operation  in  (3.13)  can  be  replaced  with 
the  corresponding  minimum  operation  (which  is  shown  later  to  be 
valid),  the  expression  for  Yg(D)  reduces  to 


(3.16) 


y!J(D*)  - { min  min  LK((D.  })}  - uD* 

jD^O,  | wen  * 

Ik-1,2,. ..K) 


where 


D*  - D^((D*})  (3.17) 

and  (D*  >_  0}  and  ir*  e II  minimize  L*((Dk>).  Hence,  (3.16)  and  (3.17) 
describe  parameterically  the  delay  distortion  relationship  as  a 
function  of  the  Lagrange  multiplier  y. 

To  determine  when  (3.16)  is  valid,  it  is  necessary  to  investigate 
LK({Dk)),  given  by 


LK({Dk»  - inf  LjJ({Dk})  , 
nell 

and  ascertain  if  there  exists  a ir*  e n which  satisfies 

LK({Dk»  - LjJ*«Dk»  . 


(3.18) 


(3.19) 


This  Investigation  can  be  achieved  by  corsiderlng  the  following  problem, 
where  the  network  is  restricted  to  a single  channel  with  transmission 
rate  C.  The  tandem  channel  case  is  treated  in  Chapter  V. 

Single  Channel  Adaptive  Data  Compression  Problem  (SCADCP) 

Consider  a data  compression  system,  as  described  in  this  section, 
which  uses  K date  compressors  whose  associated  distortion  levels  are 
Dk,  k - 1,  2,  ...»  K.  The  communication  network  Is  a single  channel 
of  transmission  rate  C.  The  state  of  the  system  is  observed  at  times 
of  arrivals  of  messages.  When  a message  arrives  it  must  be  assigned 
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c 


C 


C 


( 


I 


to  one  of  the  K possible  dsta  compressors  for  compression.  Let  the 

state  of  the  communication  system  at  the  instant  of  the  nth  message 

arrival,  n - 0,  1,  2,  . . . , be  defined  as  x . The  state  x is  defined 

n n 

as  the  vector  whose  components  are  the  waiting  time,  wn,  of  the  nth 
message,  and  the  inter arrival  time,  t^,  between  the  n-1  message  and  the 
n^*1  message  arrival  times.  Thus, 


x ■ (w  , t ) 
n n’  n 


and  the  state  space  is  given  by  X - ([0,  »)  x [0,  »)).  The  decision 

at  the  n1"*1  arrival  is  a e [1,  2,  . ..,  KJ,  where  a indicates  the 

n n 

compressor  that  operates  on  the  nth  message.  The  history  H of  the 

n 

communication  system  up  to  the  nth  arrival  is  given  by 

Hn  " (xo»  ao»  xi»  •••»  xn*  ®n)  • (3.20) 

Clearly,  Hn  is  a sufficient  exhaustive  description  of  the  history  of 
the  communication  system,  since  the  communication  system  can  be  consi- 
dered to  be  a single  server  queueing  system,  and  the  past  history  of 
such  a queueing  system  is  specified  by  the  past  interarrival  times 
and  the  past  service  times.  From  the  recurrence  relationship  for 
the  waiting  time  of  a queueing  system,  w ^ is  given  by 

Vl  * “*(wn  + fca  - W 0)  (3‘21> 

n 

where  the  transmission  time,  t^,  associated  with  a message  compressed 
by  the  kth  data  compressor  is  given  by,  k e [1,  2 K], 

tfc  - <r(Dk)  + PK)C_1  . (3.22) 


C 
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So,  using  (3.19)  a version  of  the  transition  probability  distribution, 
Pr<Wn+l  - W*  Tn+1  - TlHn)*  18  glven  by 

Pr(wn+1  1W*  Tn+1  ^-TIV  “ ^Vl^-  Tn+1  ±TI(V  V’  V 


ft(t) 

ft(t) 


, if  W > 


> w + t 
— n a 


- F (w  +t  -W)  , if  T >w  +t  -V 
t n a_  — n a 


otherwise 


A policy  ir  for  controlling  the  system  is  described  by  a set  of 
functions  xr)  , k - 1,  2,  ...,  K,  n > 0).  The  functions 

q£(*>  are  measurable  with  respect  to  the  sigma  algebra  generated  by 
and  and  the  functions  satisfy  for  k ■ 1,  2,  ....  K 


V - 0 


w.p.l 


l Qk(Hn-l*  xn)  " 1 II * <3*2' 

k-1 

These  functions  represent  the  probabilities  of  choosing  the  various 

data  compressors  given  the  past  history  and  the  present  state.  Thus, 

Qj(Hn_lt  xn)  is  the  probability  of  operating  on  the  ntb  message  with 

the  kth  data  compressor  given  history  H . and  state  x . Denote  by. 

n-i  n 

II  the  set  of  all  such  policies. 

To  measure  the  effectiveness  of  a rule  n e n,  a cost  structure 
is  introduced.  If  the  system  is  in  state  x c X at  the  nth  arrival 
and  action  k c [1,  2,  ...»  K)  is  taken,  a cost  c(x,  a)  is  Incurred. 
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This  cost  Is  given  by 


c(x,a)  - w + t + yD 

fi  8 


where  x ■ (w,  t).  Thus,  the  average  cost  *(x,  x)  under  rule  x e n, 
when  the  system  Is  Initially  in  state  x c X,  Is  expressed  by 


*(x,x)  - 11m  sup  N_1  l En(c(xn,an) |xQ  - x)  . 


*(x)  ■ inf  *(x,  x)  . 
veil 

Clearly  from  (3.11),  (3.12)  and  (3.15) 


L,((D.  })  - *((0,0),  x) 

n K 


where  l ((D,  })  is  computed  assuming  the  system  is  originally  free 

t>  k 


of  messages,  and  then  from  (3.18) 


L*({Dk»  - *((0,0))  . 


The  problem  Is  to  determine  when  a policy  x*  e n exists  such  that 
for  all  x e X 


*(x)  ■ *(x,  x*) 


The  existence  of  such  s policy  is  established  in  Section  3.4  by 

noting  that  the  process  ((x  . a ),  n ■ 0,  1,  2,  ...}  is  a Harkov 

n n 

decision  process  as  described  in  Section  3.3.  From  (3.29)  and 
(3.30)  the  determination  of  a x*  that  satisfies  (3.31)  taplles, 
the  existence  of  a x*  which  satisfies  (3.19).  The  structure  of  the 


f 


policy  ir*  that  satisfies  (3.19)  is  determined  in  Section  3.4. 

3.3  Markov  Decision  Processes 

In  this  section  Markov  decision  processes  are  studied  with  the 

emphasis  of  solving  the  average  cost  criteria  problem  for  the  case 

of  unbounded  cost  functions  and  nondenumerable  state  spaces.  A 

Markov  decision  process,  (see,  for  example  Ross  [19])  Is  specified 

by  four  quantities:  A state  space,  an  action  space,  a law  of  motion, 

and  a reward  or  cost  function.  Assume  that  the  state  space  X Is  a 

complete  separable  metric  space  and  that  the  action  space  is  a finite 

set  A.  At  time  n ■ 0,  1,  2,  ...,  the  state  of  the  process  under 

consideration  is  x c X and  action  a c A is  selected.  Then  the 
n n 

history  of  the  process  is  denoted  by  the  sequence  of  states 
and  actions, 

“ {*g»  ®q»  •••»  *^1  an)  • (3.32) 

To  specify  the  law  of  motion  let  B be  the  o-algebra  of  Borel 
subsets  of  X.  Then  assume  that  for  every  x c X and  a c A,  there  is 
a known  probability  measure  Pr(*|x,a)  on  B such  that,  for  some 
version  of  the  measure, 

Pr(xn+1  e Bl*n  “ **  *n  “ *»  Ha_i>  “ **<■!*,  a)  (3.33) 

for  every  B e B and  all  histories  Hn_^. 

The  actions  are  selected  according  to  some  conditional 


probability  distribution 


0.34) 


Pr(a  “ a|H  .,  x ) - Q"(H  .,  x ) 
n 1 n-1  n a n-1  n 


The  functions  Q°(H  , , x ) are  measurable  with  respect  to  the 
a n-1  n 

o-algebra  generated  by  the  random  variables  Hn_^  and  x^.  These 
functions  satisfy 

Qn(H  x ) > 0 w.p.l  (3.35) 

a n— l n — 

and 


T Qn(H  -,  x ) ■ 1 w.p.l  . (3.36) 

. a n— l n 

acA 

Denoted  by  II  the  set  of  all  such  policies  which  generate  actions 
in  this  manner.  A policy  ir  e n is  said  to  be  stationary  if  for 
all  a e A and  n >_  0 

Q"(H  .,  x ) - Q(x)  w.p.l  (3.37) 

a n-i  n an 

and  stationary  deterministic  if  Q (x  ) equals  0 or  1 w.p.l  for  all 

a n 

a c A and  n ^ 0. 

The  reward  function  is  specified  by  the  cost  function  c(x,a) 

which  is  the  cost  incurred  when  the  process  is  observed  in  state 

x e X and  action  a e A is  taken.  The  cost  function  is  assumed  to 

be  finite,  but  not  necessarily  bounded. 

The  process  { (x  , a ),  n - 0,  1,  2,  ...}  is  termed  a Markov 
n n 

decision  process.  Two  possible  measures  of  performance  of  a policy 
n e n that  governs  the  process  are  the  expected  total  a-dlscounted 
cost,  Va(x,x),  given  for  a e (0,1)  by 
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N-l 

V (x,x)  » lim  sup  l o"  E (c(x  ,a  ) |x_  - x} 
° N+~  n-0  * n n 0 

and  the  average  cost,  *(x,x),  given  by 

-l  N_1 

*(x,x)  - lim  sup  N J E { c (x  ,a  )|xn  - x} 

n-0  * n n 0 


(3.38) 


(3.39) 


Minimizing  over  x e II  in  (3.38)  and  (3.39),  let 


Va(x)  - inf  Va(x,x)  (3.40) 

well 

and 

♦ (x)  - inf  *(x,tt)  . (3.41) 

well 

Results  pertaining  to  Markov  decision  processes  deal  with 
establishing  when  a stationary  deterministic  policy  x*  e II  minimizes 
VQ(x,x)  or  minimizes  t(x,x).  The  development  that  follows  discusses 
the  results  for  the  average  cost  criteria  for  processes  with  unbound 
cost  functions  and  nondenumerable  state  spaces.  The  results  are  an 
extension  of  Lippman's  [21]  results. 

To  determine  when  a stationary  deterministic  optimal  rule 
exists  for  the  average  cost  criteria,  the  usual  technique  is  to  find 
the  limit  of  the  policies  which  solve  the  a-dlscounted  cost  problem 
as  a approaches  1.  Then  ascertain  if  this  limit  is  the  optimal 
stationary  deterministic  policy  for  the  average  cojt  problem. 

First,  an  assumption  needs  to  be  made  to  Insure  that  Vq(x,x)  exists 
and  is  finite  for  each  a c (0,1),  x c X,  and  x c n. 
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3 


3 


1 


) 


•> 


3 


— mMMHK  ,1, 


Assumption  3.1 

There  exists  an  Integer  m >_  1,  a real  valued  function  g(*) 
on  X with  g(x)  >_  1 for  all  x e X and  a real  number  b 0 such  that 


L i sup  { | max  c(x,a)|g(x)  } < « , 

xcX  acA 

and  for  all  x e X and  for  n - 1,  2,  ... , m. 


(3.42) 


acA  'X 


[ g(x)n  P(dx| x,a)  <_  (g(x)  + b)n 
J Y 


(3.43) 


Using  Assumption  3.1,  Lippman  [21]  proves  a theorem  which 
guarantees  the  existence  of  an  optimal  stationary  deterministic 
policy  for  the  a-dlscounted  cost  problem.  The  relevent  results  of 
this  theorem  that  are  required  for  the  development  of  the  solution 
to  the  average  cost  problem  are  stated  in  the  following  theorem. 


Theorem  3.1 

For  a Markov  decision  process,  suppose  Assumption  3.1  holds, 
then  an  optimal  stationary  deterministic  policy  wj  exists  for  the 
a-discounted  cost  problem,  a e (0,1),  and  Vq(x,xJ)  is  the  unique 
solution  to 

V (x.^J)  ■ »in  (c(x,a)  + a [ V (x,x*)  Pr(dx|x,a)>  , (3.44) 

acA  JX 

the  functional  equation  of  dynamic  programming.  In  addition,  x*  is  the 
policy  which  selects  an  action  minimizing  the  right  side  of  (3.44) 
for  each  x e X. 


Proof 


See  Llppman  [21]  for  proof. 

Q.E.D 

A consequence  of  the  proof  of  the  theorem  is  a policy  Improvement 
algorithm  for  finding  Va(x,n*),  which  is  stated  in  a corollary. 

Corollary  3.1 

Suppose  Assumption  3.1  is  valid,  and  let  a e (0,1).  Let 

Uft  (x)  “ 0 for  all  x e X and 
u,a 

U . (x)  - min  (c(x,a)  + a [ U (x)  Pr(dx|x,a)>  , (3.45) 

n+1’°  aeA  'x 

then 

V (x,x*)  - lim  U ( x ) (3.46) 

a a n»a 

Proof 

A consequence  of  the  proof  of  Theorem  1 in  Lippman  [21]. 

Q.E.D 

Before  proceeding  to  the  average  cost  problem,  a lemma  is 
presented  which  is  needed  to  show  the  relationship  between  the  limit 
of  the  a-dlscounted  cost  and  the  average  cost  for  a policy  if  e II. 

This  lemma  is  an  extension  of  an  Abelian  theorem  in  Wldder  [35]. 


Lenina  3.1 


Suppose  Assumption  3.1  is  valid,  then 


$(x,w)  >_  lim  sup  (1  - a)  V (x,ir) 
a+1  a 


for  all  x e X and  n e II. 


Proof 


Let  a e (0,1),  and  let 

N-l 


W (x,ff)  - lim  sup  E { l a l c(x  ,a  )|x  ■ 

n-0  j-0  J J 0 


x} 


N-l  n N N-l 

lim  sup  E { [ l — c(x  ,a  ) - y-—  £ c(x  ,a 
* a l-a  n n 1-a  n i 


n-0 


n~0 


for  all  x e X,  and  n e n.  But  from  Assumption  3.1  for  all  x 
a c A 


c(x,a)  <_  L g(x)m 

and  from  (3.43)  for  all  i c n,  n > 0,  and  x e X,  the  bound 

E1[(c(xn,an)|x0  - x]  £ (g(x)  + (n-l)b)m 

is  obtained.  Thus,  one  obtains 
N N—1 

U^!up  E.  J0  C<V»>  K * »>  • 0 • 

So,  substituting  (3.38)  and  (3.51)  into  (3.47), 


(3.47) 


) ] | x0  - x} 

(3.48) 

X and 

(3.49) 

(3.50) 

(3.51) 


r 


N-l  n 
a 


VX*w)  - lin  sup  E { £ — c(x  ,a  )|x  - x} 

n-0  1 n n 0 


Va(x,n) 

l-o 


Rewriting  (3.52),  for  all  M > 0 


(l-o)  Va(x,ir)  - W a(x,it)  (l-a)‘ 


(3.52) 


' ’ 


" (l-a)2  l a"  l E (c(x  ,a  )|x  - x) 

n-0  j-0  * J J 0 


N-l 


+ lin  sup  (l-o)  I non  ^ E (c(x.,a.)|x  - x}/n 
N-~  n-Hfl  j-0  3 3 0 

(3.53) 

Now  consider  the  bounds  which  state  that  for  all  m - MML,  ....  N-l 
1 a a< 

: E»<c<‘j-*i)|'o  - *'  ■ *>  o.m) 


and 


N-l 

l 


n „ a 


na  < 


2 * 


(3.55) 


n-M+1  (l-a) 

Using  these  bounds  in  (3.53),  results  in 

(i-a)  Vo(x,w)  <_  (l-a)2  l a11  l E {c(x  ,a  )|x  - x) 

n-0  j-0  3 3 0 

■ 

+ li*  sup  sup  o{  l E (c(x.,a  )|xn  - x}/«}  (3.56) 

1H-  N>a>M  j-0  3 3 0 

Taking  the  11a  sup  of  (3.56)  as  a ♦ 1 and  noting  that  for  0 n < M 

n 

l E*(c(V*|>l*o  " x)  < m • (3*57) 

J— 0 J •* 
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it  Is  clear  that 


11m  s 
all 


IQ 

sup  (1-a)  V (x.tt)  < sup  { l E [c(x,,a.)|xft  - x]/m} 
1 ° m>Hfl  j-0  J 3 0 


(3.58) 


Thus,  letting  M become  arbitrarily  large,  and  using  (3.39),  the  result. 


(3.47),  is  achieved. 


Q.E.D 


Further  assumptions  need  to  be  made  to  guarantee  that  some 
stationary  deterministic  policy  is  average  optimal.  These  assumptions 
deal  with  the  convergence  of  the  a-dlscounted  minimal  cost  function  to 
the  minimal  average  cost  function,  and  differ  from  the  assumptions 
made  in  Lippman  [21]. 


Assumption  3.2 

There  exists  a state  x*  e X,  a value  a*  e (0,1),  and  a real 
valued  function  L(x)  on  X such  that  for  all  a e (a*,l)  and  x e X 


lva(x*,n*) 


- iL<*> 


(3.59) 


when  ir*  exists, 
a 


Assumption  3.3 


There  exists  a stationary  policy  w,  and  an  Increasing  sequence 


{a  } with  a II  that  satisfies 
n n 


♦(x,x)  - lim  sup  (1-a  ) V (x,n*  ) 

n a an 


(3.60) 


for  all  x e X. 
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A technique  for  finding  tt  in  Assumption  3.3  is  to  place  a further 

assumption  on  the  problem  to  force  tt  to  be  the  limiting  policy  of 

{w*  }.  For  the  case  of  a continuous  state  space,  the  limit  of  a set  of 
n 

policies  is  a difficult  concept  to  define.  In  verifying  the  assump- 
tions for  the  SCADCP  outlined  in  Section  3.2,  the  limit  of  the  policies 

ifj  for  the  problem  can  be  defined  quite  easily.  Hence,  tt,  which 
n 

is  the  limit  of  the  policies,  can  be  found.  Section  3.4  discusses 
these  results. 

The  following  theorem  proves  that  if  a Markov  decision  process 
satisfies  the  Assumptions  3.1,  3.2  and  3.3,  then  an  average  optimal 
stationary  deterministic  policy  exists. 

Theorem  3.2 

Suppose  Assumptions  3.1,  3.2,  and  3.3  hold,  then  the  average 
optimal  policy  it*  is  a stationary  deterministic  policy  given  by 
the  policy  r in  Assumption  3.3  and  for  all  x e X,  *(x)  in  (3.41) 
is  Independent  of  x. 

Proof 

From  Assumption  3.1  and  Theorem  3.1,  for  all  a e (0,1),  it* 

exists  and  is  a stationary  deterministic  policy.  From  Lemma  3.1 

for  any  increasing  sequence  {a  } with  a ♦ 1, 

n n 

♦ (x,it)  >_  lim  sup  (1-a  ) V (x,w)  (3.61) 

tt+~  n 

for  all  x c X and  all  it  e II.  Since  II*  is  the  a-optimal  discounted 
cost  policy,  then  for  all  a c (0,1) 


t 


c 


c 


t. 


t 


c 


t 


t 


c 


t 


(1-a)  Va(x,n)  >_  (1-a)  Vffl(x,x*) 


Taking  the  11m  sup  with  respect  to  a,  and  using  (3.60)  and  Assumption 


3.2, 


♦(x,x)  >_  11m  sup  (1-u  ) V (x,x*  ) 

n n 


11m  sup  (l-o  ) V (x*,x*  ). 

n o o 

»■*“  n n 

(3.62) 


Finally,  since  the  sequence  {a^}  is  arbitrary,  applying  Assumption 
3.3  to  (3.62)  gives 


#(x,x)  6(x,x)  ■ #(x*,n) 


(3.63) 


for  all  x e II,  and  x e X.  Consequently,  since  (3.63)  holds  for  all 
x c X and  x c II,  then 

♦(x)  - *(x*,x)  (3.64) 

and 

x*  - x . (3.65) 

Q.E.D 


The  procedures  utilized  In  Theorem  3.2  differ  from  the  procedures 
used  In  the  classical  works,  as  In  Derman  [17]  and  Ross  [19].  In  these 
works,  first  a theorem  Is  proved  stating  that  if  the  functional 
equation,  which  is  similar  in  nature  to  (3.44),  for  the  average  cost 
problem  has  a solution,  an  average  optimal  stationary  deterministic 
policy  exists.  Then  to  show  the  existence  of  such  a solution,  the 
limit  of  the  functional  equation  (3.41)  for  the  a-discounted  cost 


--dtr-Vv 
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problem  is  examined.  This  technique  requires  additional  assumptions 
beyond  Assumption  3.1  - 3.3  to  guarantee  the  convergence  of  the 
functional  equations.  These  additional  assumptions  might  not  be 
easily  verified  for  Markov  decision  processes  with  unbounded  cost 
functions  and  nondenumerable  state  spaces. 

3.4  The  Structure  of  Optimal  Policy  for  the  Single  Channel  Adaptive 

Data  Compression  Problem 

The  Single  Channel  Adaptive  Data  Compression  Problem  (SCADCP) 
outlined  in  Section  3.2  deals  with  the  issue  of  optimally  controlling 
the  selection  of  data  compressors.  This  issue  can  be  categorized 
as  a problem  dealing  with  the  optimal  control  of  a queueing  system. 

The  techniques  derived  from  examining  Markov  decision  processes  are 
used  extensive  in  solving  this  class  of  problems.  In  this  section, 
for  the  SCADCP  where  the  adaptive  compression  scheme  uses  two  data 
compressors  (K"2),  the  structure  of  the  optimal  policy  is  determined 
using  the  results  of  Markov  decision  theory. 

In  order  to  apply  the  results  in  Section  3.3  to  the  SCADCP,  it 
must  be  verified  that  a Markov  decision  process  is  being  generated 
by  the  coomninlcation  system  under  consideration.  Clearly,  the  process 
{(*n»*n)»  n«0,  1,  2,  ...}  of  the  SCADCP  is  a Markov  decision  process 
since  the  state  space  X - <[<>,•)  x [0,-)),  action  space  [1,2],  the 
law  of  motion  (3.23)  and  the  cost  function  (3.26)  all  satisfy  the 
requirements  outlined  in  Section  3.3  for  a Markov  decision  process. 
Thus,  the  structure  of  the  optimal  policy  can  be  found  by  using 
Theorem  3.2,  if  Assumptions  3.1,  3.2  and  3.3  can  be  verified. 


t 


Before  proceeding  with  the  verification  of  the  assumptions, 
assume  without  loss  of  generality  for  the  remainder  of  the  chapter 
that  the  distortion  levels  associated  with  the  two  data  compressors 
are  ordered  in  the  following  manner: 

Dx  < D2  . (3.66) 

Furthermore,  consider  the  two  regions. 

Region  1:  D2  <_  r_1(y  “ P2) 

Region  2:  D2  > r_1(^  “ P2>  • 

For  region  1,  for  every  it  e n,  4>(x,x)  is  unbounded  as  shown  in  the 
following  lemma. 


Lemma  3.2 


°2  - ?"1(?  " P2)  • 0.67) 

then  #(x,x)  Is  unbounded  for  every  ir  e n. 

I 0 

Proof 

From  (3.22)  and  (3.67),  for  k ■ 1,  2 

tk  - (r(Dk)  + p2)C_1  > X'1  (3.68) 


and  from  (3.66), 


r 


Expression  (3.21)  for  the  waiting  tine  gives 


Vl  - "“«>•  + ta  * Vl> 


(3.70) 


Thus,  the  policy  i which  minimizes  wr  for  all  n > 0 is  the  policy 
that  always  selects  data  compressor  2.  So,  for  all  n > 0 


M”.1*  ■ V i - "o' 


(3.71) 


Under  i,  wn  is  the  waiting  time  of  the  nth  customer  in  a M/D/1  queueing 
system  with  service  time  t^t  and  mean  interarrival  time  X \ From 
(3.68)  the  traffic  Intensity  p is  given  by 


P - t2/x  >_  1 


(3.72) 


and  therefore  by  Llndley's  theorem  (Lindley  [33]),  E- (wr | xQ  - x) 
grows  without  bound  (i.e.,  lim  E (w  |x  - x)  - »).  Now  using  (3.24) 

it*»  n u 

and  (3.25),  *(x,v)  is  bounded  by 


«(x,w)  >_  11m  sup  N-1  l Eff(wn|x0  - x) 
“ — n-0 


N-l 


> lim  sup  N_1  l E«(w  |xn  - x) 


N-l 


n**0 


i n'  0 


(3.73) 


Thus , 4(x,v)  is  unbounded. 


Consequently,  the  region  of  Interest  is 


Q.E.D 


D7  > ~ P,> 


(3.74) 


and  the  following  set  of  lemams  verify  the  assumptions  stated  in 
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Section  3.3 


Lemma  3.3 


Assumption  3.1  is  valid  for  the  SCADCP. 


Proof 


Let 


m ■ 1 


> 


g((w,T))  = W + 1 


and 


b * max  t < 00 
ae[l,2]  a 


Then 


sup  (|  max  c((w,r) ,a) |g((w,t))  } 

we[0,®)  ael,2 
xe[Of“>) 

= sup  { | max  (w  + t + pD  ) | (w  + 1) 
we[0,»>)  ae[l,2] 

te[0,“>) 


< 1 + max  (t  + pD  ) < ® (3.75) 

ae[l,2]  a a 

and  (3.42)  is  verified. 

To  verify  (3.43),  using  (3.23) 
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max  [ g((w,t))  Pr(d(w,r)| (w,r),a) 

ae[l,2]  J 

we[0,°°) 

Te[0,») 


< max  (w  + 1 + t ) Pr(d(w,r) | (w,t) ,a) 

ae[l,2]  a 1 

' we[0,°°) 

te[0,®) 


< g((w,t))  + b 


(3.76) 


Hence,  Assumption  3.1  is  valid. 


Q.E.D 


By  Theorem  3.1,  there  exists  an  a-optimal  stationary  deterministic 
policy  for  all  a e (0,1)  which  can  be  found  using  the  policy  improve- 
ment algorithm  of  Corollary  3.1.  The  following  two  lemmas  establish 
the  structure  of  the  a-optimal  policy. 

Lemma  3.4 

An  a-optimal  policy  exists  for  all  a e (0,1)  and  is  a stationary 
deterministic  policy  that  depends  only  on  the  observed  waiting  time. 
Hence,  the  a-optimal  policy  is  not  a function  of  the  observed  inter- 
arrival time  given  the  observed  waiting  time. 


Proof 

Lemma  3.3  and  Theorem  3.1  imply  that  an  a-optimal  policy  exists 
and  is  a stationary  deterministic  policy.  To  show  that  the  a-optimal 
policy  is  Independent  of  the  observed  Interarrival /:lme,  first  note 

7* 


’.rwr\ 


that  from  (3.26)  c((w,T),a)  can  be  written  c(w,a)  since  it  is  not 
a function  of  x.  Consider  the  policy  Improvement  algorithm  (3.44) 
where  clearly  Uq  a((w,x)),  given  by 


U0,a((w’T))  " 0 


for  all  w e [0,“)  and  t e [0,“),  is  independent  of  x.  Now  suppose 

U ((w,x))  is  only  a function  of  w (i.e.;  U ((w,x))  - U (w) 
n,a  n,a  n,a 

for  all  w e 10,“)  and  x e [0,”)).  Then,  it  is  easy  to  verify  that 
Vi,  a((w,x))  as  given  by 


min  (c(w,a)  + a [ U ((w,x))  Pr(d(w,x) | (w,t) ,a) } 
ri  91  f ) n,o 


ae[l,2] 


we[0,“) 
xe [0,“) 


is  not  a function  of  x,  since  the  conditional  distribution 

Pr(w  . < W,  x < T| (w  ,x  ),a  ) in  (3.23)  is  independent  of  x . 
n+i  — n+i  — n n n n 

Thus,  by  induction  U ((w,x))  is  independent  of  x for  all  n > 0. 

lljCl  ““ 

By  (3.46)  Va((w,x)  ,n*)  is  given  by  the  limit  of  Un  a((w,x))  as  n -*■  “, 
and  hence  Va((w,x) ,n*)  is  Independent  of  x.  Since  the  policy  ir* 
minimizes  the  right  side  (3.44),  and  the  right  side  of  (3.44)  is  not 
a function  of  x,  then  it*  is  a policy  which,  given  the  observed  waiting 
time  value,  is  Independent  of  the  corresponding  observed  lnterrarlval 
time. 


Q.E.D 


Before  proceeding,  the  definition  of  one  type  of  stationary 


» 


deterministic  policy  is  required.  A stationary  deterministic  policy 
is  said  to  be  a connected  policy  if  the  state  space  X can  be  sub- 
divided into  disjoint  connected  regions  Xfl,  a ■ 1,  2,  . ..,  K,  such 
that  tt  specifies  action  a is  to  be  taken  if,  and  only  if,  the  system 
is  observed  in  state  x e Xfi.  The  following  lemma  establishes  that  the 
a-optimal  policy  is  a connected  policy. 

Lenina  3.5 

For  two  data  compressors,  A ■ [1,2],  the  a-optimal  policy, 
a e (0,1),  is  a connected  policy.  This  policy  specifies  that  data 
compressor  one  is  selected  when  the  waiting  time  w of  a message  satis- 
fies w £ Ta  and  otherwise  data  compressor  two  is  selected.  T^  is  term- 
ed the  threshold  and  is  selected  to  minimize  the  a-discounted  cost  prob- 
lem. 

Proof 

From  Lemma  3.4  any  dependencies  on  the  interarrival  times  can  be 

disregarded.  So,  substituting  the  value  of  c((w,r),a),  equation  (3.24), 

and  the  expression  for  Pr(w  < W,  t ..  < T|(w  ,t  ),a  ),  equation 

irrx  — irt\L  — n n n 

(3.23),  the  policy  Improvement  algorithm  (3.45)  becomes 

rrw+t 

w+t  +yD  +a|  8 U (w+t  - w)F  (dw) 

a a LJq  n,a  a t 

+ 0 (0)  (1  - Fx(w  + ta))]}  (3.79) 

where 

UQ(w)  - 0 (3.80) 


min 

ae[l,2] 
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for  w e [0,“).  Now  define 


0 


(. 


c 


r 


r 

# 


( 


< . 


I 


0 <v)  = f U (w -0)  F (dw)  + U (0)  (1  - F (w)) 

n,a  Jq  n,a  t nfa  t 


(3.81) 


and  so  (3.75)  becomes 


U (w)  **  min  {w+t  + pD  + a U (w  + t )}  . (3.82) 

nTljOl  r«  ni  A 3 3 

ac[l,2] 

In  order  to  find  ir*  It  is  necessary  to  find  the  behavior  of 

U (w)  that  satisfies  (3.82).  First,  it  is  required  to  show  that 
n,a 

U (w)  is  a nondecreasing  function  of  w for  all  n > 0,  and  for  all 
n,a  — 

a e (0,1).  Clearly, 


U,  (w)  = w + min  (t  + jjD  ) 

X jfll  r . n 1 3 3 

ae{1.2] 


(3.83) 


and  so  U.  (w)  is  a nondecreasing  function  of  w,  for  w e [0,”). 
l,a 

Now  suppose  U (w)  is  a nondecreasing  function  of  w.  Then  from 

II  9 Cl 

(3.81)  0 (w)  is  a nondecreasing  function  and  so 

n,a 

t + pD  + aU  (w+t  ),  a = 1,2,  are  nondecreasing  functions.  Thus, 

3 3 3 9 & 3 

the  lower  envelope  of  (w  + t + pD  + a U (w  + t ),  a ■ 1,  2), 

3 3 3 9 01  3 

which  is  Un+^  a^*  a non<*ecreasln8  function  of  w.  Therefore, 
by  induction,  U (w)  is  a nondecreasing  function  of  w for  all  n > 0 

II  9 Cl  " 

and  a e (0,1). 

Now  to  show  that  the  a-optimal  policy  is  a connected  policy 

with  threshold  T , define  for  w < 0 
a 


U 

n,a 


(w) 


U 

n,a 


(0) 


(3.84) 
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un  (v)  - ft  (o) 

n,a  n,a 


Furthermore,  define  for  w e (-“>,“) 


AU  (w)  - 0 (w  + t.)  - U (w  + t.) 
n,a  n,a  1 n,a  2 


4Cn,.(")  ’ 0„,a(”  + *1>  - + t2>  ' «-8,> 


By  (3.81)  aO  (w)  satisfies 
n,a 


+ ‘l  * »)  - + '2  - *>!»,«*> 

■ j AU_  “ w)F  (dw)  . 

Jn  n»°  T 


(3.88) 


Clearly  from  (3.82),  at  stage  n+1  the  optimal  policy  is  to  select 
data  compressor  1 if  the  waiting  time  w satisfies 


tx  - t2  + - D2)  + a AU  (w)  < 0 


and  otherwise  use  data  compressor  2.  By  (3.80) 


(3.89) 


AU0  ” 0 (3.90) 

for  all  w c (-«,«),  which  implies  that  the  optimal  policy  for  the 
1 stage  problem  is  a connected  policy  with  threshold 


T,  - 0 or  » . 

1,0 


(3.91) 


'W 


m 


■~f'TT  ' 


♦ 

1 


o 


Now  suppose  AU  (w)  Is  a nondecreasing  function  of  w,  for  all 
n,a 

w e (-00,®),  and  for  all  o e (0,1).  By  (3.88),  AU  (w)  is  a non- 

n,o 

decreasing  function  of  w.  Hence,  since  t^  - + p(D^  - D^) 

+ a AU  (w)  is  a nondecreasing  function  of  w,  equation  (3.89)  implies 
n,a 

that  the  optimal  policy  for  the  first  stage  of  the  n+1  stage  problem 

with  discount  factor  a is  a connected  policy  with  a threshold  labelled 

T . , > 0".  So,  from  (3.82)  and  (3.86),  AL  . (w)  is  given  by 

n+l,a  — nri.a 


, if  w < -«- 


^Vl,a(wftl>-Vl,a(0)  ’ lf  -C1  < W 1 _t2 


AU  . (w)  “<  t1-t.+aA0  (w+t.)  , if  -t,  < w < T . - t. 

n+l,o  X 1 2 n,a  1 * 2 — n+l,a  1 


Fvi(D2  - Dx) 


, if  T , , -t,<  w<T ' , -t 

* n+l,a  1 — n+l,a  2 


^trt2'taa(,,,.a<'rtt2)  • lf  Vl.a  - C2  ' 


(3.92) 


Since  U (w)  and  Af)  (w)  are  nondecreasing  functions  of  w and 
n+l,a  n,a 

clearly  AU  , , (w)  is  nondecreasing  between  regions,  then  AU  ..  (w) 

nrrl,a  wri,u 

is  a nondecreasing  function  of  w.  Thus,  by  induction  AU  (w)  and 

n,a 

A 

AU  (w)  are  nondecreasing  functions  of  w for  all  n > 0 and  all 
n,a  — 

a c (0,1). 

Now,  from  (3.46) 


A V„(w,n*)  - V>  + tv  »a)  - V>  + t„  **) 


a a 


u2*  a 


lira  AU  (w) 
n,o 


(3.93) 
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f 


r 


! 


k 1 


and  so  AV  (w,tt*)  is  a limit  of  nondecreasing  functions.  Thus, 
a a 

AVa(w,ir*)  is  a nondecreasing  function  of  w.  Furthermore,  from 
(3.A4)  and  (3.89),  it*  is  the  policy  which  selects  data  compressor  1 
if  the  waiting  time  w satisfies 

tx  - t2  + p(D1  - D2)  + o | AVa(w-w)Ft(dw)  < 0 (3.94) 

and  otherwise  utilizes  data  compressor  2.  But  the  integral  in  (3.94) 

is  a decreasing  function  of  w,  and  so  it*  Is  a connected  policy  with 

a threshold  to  be  labelled  T > 0 . 

a — 

Q.E.D 


Thus,  the  a-optimal  policy  has  been  shown  to  be  a connected 
policy  specified  by  a threshold  Tq  >_  0 . Since  all  connected  policies 
of  this  form  with  negative  thresholds  are  equivalent  (policies  which 
always  use  data  compressor  2),  then  assume  Tq  is  bounded  by  Tq  -1. 
The  following  lemma  proves  that  Tq  is  upper  bounded  for  all  a c (a*,l) 
where  a*  e (0,1). 

Lemma  3.6 

The  threshold  Tq  specified  in  Lemma  3.5  is  bounded  for  all 
a e (a*,l)  by 


Ta<Mr 

where 
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(3.95) 


* 


) 


f 


t 


and 


.-1 


Hj.  “ max(A  , (tx  - t2) 


In  2 
A in  n* 


) 


(3.96) 


Proof 


Let  n(T)  be  the  connected  policy  with  associated  threshold  T. 
Let  Tj  and  be  such  that 


Tx  + 2(tx  - t2)  + p(D2  - Dx)  < T2 


(3.97) 


1 2 

Furthermore,  define  the  sequences  of  random  variables  (w  } and  {w  } 

n n 

by 


J 

n+1 


max( 


+ *1  - w 

9 

if  w < TJ 
n ~ j 

, J - 1,  2 

+ C2  ■ ’n+l* 

9 

if  w > T. 
n j 

(3.98) 

where  for  J ■ 1,  2 


t 


C 


0 


f 


wj  » W , (3.99) 


and  {Tn>  is  a sequence  of  Independent  identically  distributed  ex- 
ponential random  variables  with  mean  A-1.  Thus,  the  sequences  defined 
in  (3.98)  are  the  sequences  of  waiting  times  under  policy  n(Tj)  and 
w(T2).  Then 


N-i 

,(w»*T  ) - lin  sup  l a11  E{c,  (w") } 
J n-0  2 n 


(3.100) 


where 


Cj(w)  - w + (tj  + pDx)  I(w  <_  Tj)  + (t2  + pD2)  I(w  > Tj)  . (3.101) 
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To  show  that  va(w*nT  ) is  smaller  than  V^w.i^  ),  first  It  is 

required  to  show  that  w(1^  < w^2)  for  all  n > 0.  Suppose  w1  < v2. 

n — n — n — n 

Then  if  w2  < T2,  clearly  w*+1  < w^  by  (3.98),  and  if  w2  > T2 

1 12 

and  Wn  > ^1*  a8ain  wn+i  1.  wn+i  hy  (3.98).  The  case  remaining  is  for 
1 2 

wn  1.  and  wn  > t2*  For  this  case  by  (3.97)  Wj  + t^  <_  w2  + t2» 

and  so  < w^.  Thus,  if  w*  < w2  then  w^  < w2+r  Therefore, 

1 2 

by  induction  w < w holds  for  all  n. 
n — n 

Now  let 

max{0,  sup(n:  n <_  n and  wi  ■ 0))  , if  n > 0 

N - 
n 

0 , if  n - 0 (3. 1C 


1 if  w1  > T, , w2  < T„ 

n — 1 n 2 

A - -1  if  wi  < T.,  w2  > T0  . (3.103) 

n n i n — z 

0 otherwise 

1 > 

Clear  if  m satisfies  N < m < n,  then  w + 0 which  implies  w * 0. 

n — m m 

Thus,  solving  the  recurrence  relationship  (3.98),  w^  is  given  by 

+ I U.  Kw^  < T.)  + t,  I(J  > T.)  - t . 

nm-N  lmj  2m-j  nri-1' 

” (3.104) 

Then  using  (3.103)  and  (3.104),  the  difference  between  w^  and  w^ 

n n 

is  given  by 


2 

Aw  - w - w 
n n n 


4\  +i  *»(t‘ ' **» 


(3.105) 


It  easily  follows  that  if  N l4  n and  Aw  , > T-  - T,  - (t,  - t_), 

n n-1  — 2 1 1 2 

then  Aw^  >_  7^  - - (t^  - t2).  Hence,  by  induction,  when 

2 

Awn  1 T2  " T1  " (tl  " t2)»  then  Awn  - T2  " T1  " (tl  " t2)  and»  by 
n 

(3.97)  and  (3.101)  the  difference  in  costs  is  bounded  by 


c,(w2)  ~ ci(wb  > 0 
z.  n in  — 


(3.106) 


In  a similar  manner,  if  Aw^  < T2  “ Tl  “ ” c2 )»  equation  (3.105) 

n 

implies 


n-1 

I A > 0 . 

u m — 


(3.107) 


m-N 


Now  define  the  sequence  B by 


lf  AwN  1 t2  ~ Ti  ~ (tx  “ t2) 


v n 


otherwise  . 


(3.108) 


So,  from  (3.101),  (3.105),  (3.106),  the  difference  between  the  cost  is 
bound  by 

Ac(w2,w*)  - c.(w2)  - c.(w*) 
n n L n In 

n-1 

- { X Bm(tl  " t2)  + Bn(tl  _ t2  + p(Dl  " °2)}  * <3*1°9> 

m-N 

n 

Then  multiplying  (3.109)  by  an  and  sum  over  n,  the  following  bound  is 
obtained : 


N-1 

5 

n- 


-1  _ 2 1 n n 

l a Ac(w  ,w  ) > J o ( J B (t.  - t ) + B y(D_  - D ))  (3.110) 

i-O  n n n-0  m-N  m 1 2 n 1 2 


In  order  to  Interchange  the  order  of  summation  in  (3.111)  define 

#nb. 
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N ■ inf{n  > n:  w i ■ 0} 

n n 


and  so  (3.110)  becomes 


N-l  N-l  min(N-l,ftn-l) 

l a"  Ac(w^.w^)  > l B { l om(t  - t ) + a"  u(D  - 

n-0  n n n-0  n m-n  12  1 


Hence,  from  (3.100)  it  is  dear  that 

/N-l  (®n“n) 

V«(”-”t2)  - V“'V  i \l0  «"  > ('!  - t2) 


“<D1  - »2>))  • 


To  show  that  the  right  side  of  (3.113)  is  positive,  first  define 


M by 
n 


M - inf {n  > n:  B«  0}  . 

n n 


Then,  it  is  easily  shown  if  n is  select  such  that  B = 1 and  B = -1, 

n M 

* n 
then  M < N • Hence, 
n n * 

-n 

a°  (tl  - t2)  + “»!  - D2» 

M (ftn'Mn) 

+ “ 11  ®M  - a > <t1  - t2)  + p(D1  - D2)}  > 0 . (3.115) 

n 

Furthermore,  it  is  clear  that  if  n is  selected  such  that  B - -1, 

n * 

then  B • 1.  Thus,  for  every  n such  that  B*  « -1  there  exists  a 
n n 

n < n such  that  Bn  • 1,  Mn  - n,  and  (3.115)  is  satisfied.  Now  define 

the  sequence  by 
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0 


-1 


Q 


C 


C 


c 


c 


0 

B 


if 

Bn"1’ 

if 

B - -1 
n 

M 


n 


otherwise 


(3.116) 


Thus,  using  (3.115),  (3.113)  can  be  further  lower  bounded  by 

N-l  ®n_n 

V«(w'ffT  J _ V«^w*nT  > i 11,0  8UP  E I “n  C {(•— — — -)  (t.  - t ) 
“ T2  “ T1  N~>  n-0  n 1 - a 12 

+ m(dx  - d2)}  . 


(3.117) 


Clearly  from  (3.116),  C » 0 or  1,  Thus,  to  prove  the  right  side 

n 

of  (3.117)  is  positive,  it  is  sufficient  to  prove  that  for  all  n >_  0 


E{(1~T-  a— > (tj_  " t2)  + p(Dx  - D2)|Cn  - 1}  > 0 


(Nn-n) 


(3.118) 


Using  Jensen's  inequality,  a lower  bound  to  the  right  side  of  (3.118) 
is  given  by 


(fin-n) 

- . > <£1  - ‘2’  + "<D1  - VlCn  ’ 11 

E(fin-n|Cn-l) 

- (1~°  1 - a > (tl  _ t2)  + u(Dl  ' V ‘ (3.119) 

Now  E(Nn  - n|Cn  ■ 1)  can  be  lower  bounded  by  examining  the  sequence 


3. , - 3 - t 
n+1  n n+1 

where  w„  - T,  - (t,  - t„)  and  (t  } is  a sequence  of  Independent 
u 1 1 L n 

Identical  distributed  exponential  random  variables  with  mean  X 
Hence,  if 


(3.120) 
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(3.121) 


N ■ lnf{n  >_  0:  U ^ <_  0}  , 


then  it  Is  easily  argued  that 


E(N)  < E(ft  - n|c  - 1) 
— n n 


(3.122) 


But  N is  a Poisson  random  variable  with  mean  X(T^  - (t^  - t^)). 
Hence,  using  (3.122)  in  (3.119),  It  Is  found  that 

(Vn) 

ECC1';--.  a ) (tx  - t2)  + y(Dx  - D2)|Cn  - 1} 


x(t  -(trt2)) 

i > 

1 - a 


-)  (tx  - t2)  + p(D1  - D2)  (3.123) 


Nov  let 


(0  i - (tl  ~ t2-  . . ) 
\ 2u(D2  - Dj)/ 


a*  ■ max 


Ti  " Bax(X’1’  (tj.  ‘ t2)  + < 


So,  It  is  easily  shown  for  a e (a*,l). 


x(Tr(trt2)) 

* _ ^ 

1 - a 


-)  (t1  - t2)  + p(D1  - D2)  > 0 . 


Hence,  from  (3.126)  and  (3.123),  equation  (3.118)  is  valid 
for  a e (a*,l)  when  a*  is  given  by  (3.124)  and  is  given  by  (3.125). 
Then  substituting  (3.118)  into  (3.117)  it  is  clear  that  for  a c (a*,l) 


V (Wi”,  ) - V (w,it  ) > 0 
a * T2  a'  * — 
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Thus*  can  not  be  a-optlmal,  and  so,  from  (3.97)  the  a-optimal 
threshold  Tq  must  satisfy 


C 


T < T + 2(t-  - t,)  + p(D,  - D.)  < 


(3.128) 


Q.E.D 


c 


Before  proceeding  with  the  verification  of  Assumptions  3.2 
and  3.3,  consider  the  sequences  of  random  variables  generated  by 

the  following  equations:  for  n > 0 


wn+l  * Sn  + '2  - Vl> 


(3.129) 


and 


° 


n+1 


, if  w ■ 0 and  t . , > z + t_ 
n n+1  2 


+ z , otherwise 


(3.130) 


JJ  A A H 

where  w^  - 0 >_  0,  w^  - w^  + z,  {t^}  is  a sequence  of  Independent 
identically  distributed  exponential  random  variables  with  mean  A-*, 
z is  an  arbitrary  number,  and  t2  is  given  by  (3.22).  The  sequence 

ft 

of  random  variables  {w^}  can  be  considered  a sequence  of  waiting 
times  of  a M/D/1  queueing  system  with  service  time  t2  and  mean 
Interarrival  time  equal  to  A \ Thus,  the  sequence  of  random  variables 
(w  } i®  observed  to  be  the  shifted  version  of  {w  },  except  that  after 

“ T1 

A 

two  consecutive  values  of  w are  zero,  w may  transition  to  a state 

n n 

labelled  0. 

Now  consider  the  sequences  of  waiting  times  in  two  separate  M/D/1 
queueing  systems  which  use  connected  policies  with  thresholds  T^  and 
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T. 


T_.  Let  (w  and  {w  be  the  sequences  of  waiting  tines 

L n n 

when  threshold  Is  used  where  w^®*^  - 0 and  w^"*"^  - w.  Furthernore, 
(0  2)  (1  2 

let  (w  * } and  (w  * )}  be  the  waiting  tine  sequences  when  threshold 

n n 

is  used  where  w^®*^  ■ 0 and  Wq1'^  - w.  So,  from  (3.21)  the 
recurrence  relationship  for  the  waiting  tine  sequences  are  given  by 

nax(0,  w^1*^  + t-  - r ,)  . if  w^1’^  < T 
n i n+i  n — j 

nax(0,  w*1*^  + t,  - T ..)  , If  w*1*^  > T 
n z n+i  n j 

(3.131) 


/i.J)  . 

n+1 


where 


and 


.(0,j) 


J - 1,  2 


w ^ 0 , j - 1,  2 


(3.132) 


(3.133) 


and  (t  ) is  the  sane  sequence  as  In  (3.129).  The  following  lemna 
n 

shows  w of  (3.130)  upper  bounds  w^1*^ . 
n n 


3.7 


For  the  sequences  defined  by  (3.130)  and  (3.131),  for  all  n ^ 0 


< w 

n — n 


1 - 0.  1, 


J - 1,  2 


(3.134) 


where 


*’Tl+tl  • 
-<°*1)  - *<°’2)  - 0 . 
(1,1)  . (1,2)  . 


(3.135) 

(3.136) 

(3.137) 


I 


and 


wQ  “ max(0,  w-z)  . 


(3.138) 


Proof 

The  proof  Is  by  Induction.  First,  fix  i and  j.  From  (3.136), 
(3.137)  and  (3.138)  it  is  clear  that 


< C . 
wo  - wo 


(3.139) 


n 


\ + H - Vi  - 


. Then  for  w > z, 
i n — 

W(i’j)  + t - T 

wn  + tl  Tn+1  * 

if 

W<M>  < T 

n — 

w(1’J)  + t - x 
wn  C2  n+1  * 

if 

w<m>  > T 

n 

(3.140) 


since  + t,  < z when  < T..  Then,  using  (3.131), 

n X n — j 


max( 


<®n  + '2  - Vl-  ,H'- v-.,:l) 


n+1 


(3.141) 


and 


max(wn  + t,  - t^,  0)  > v^>  . 


n 2 ‘n+1 

Thus,  from  (3.129)  and  (3.130) 


w > 
n+1  - n+1 


(3.142) 


For  the  remaining  case  of  w >0,  equation  (3.139)  gives  w^1’^  - 0. 


Then  from  (3.129)  and  (3.130) 
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•vs 


*£*|  • 
m 


it 


w(i,j)  « 

itfl  - 


i°  • lf  Vi  - 0 
2 • lf  Vi^2 


(3.143) 


and  so. 


- > w(i.J) 

tr+1  - n+1 


(3.144) 


Consequently,  by  Induction  (he  result  (3.134)  Is  achieved. 


Q.E.D 


Using  random  variables  {vq}  the  average  waiting  time  of  the 


queueing  systems  can  be  bounded.  In  order  to  bound  the  performance 

of  the  system.  It  is  required  to  find  the  mean  number  of  steps  T^w) 

for  the  sequence  of  random  variables  {w  ) to  reach  state  0 when 

n 

WQ  - max(0,  v-z). 


3.8 


For  w ^ z,  $z(v)  satisfies 


T^(w)  • 1 + [e*2  + X(w-z)]  (1  - Xt2>  * (3.145) 


Proof 


Define  the  sequence  (Sn)  by 


s« ' J,  (T‘  - V 


n > 1 


(3.146) 


where 


so.o 


(3.147) 
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M. 


and  {t  > is  Che  sane  sequence  as  in  (3.129).  Let  N be  the  stopping 
n 

time  given  by 


N - lnf{n:  S > w-z) 

n “ 


(3.148) 


which  is  the  number  of  steps  to  get  from  ■ w to  state  z.  Wald's 
equation  gives 


E{  l (x  - t,)}  - E(N)  E(x  - t ) 
i-1 


(3.149) 


But  by  appropriately  adding  and  subtracting  v-z,  it  is  obtained  that 


N 


E{  l (Tn  - t2)>  - 


n-1 


w-z 


+ e|e(tn  - [w-z-  l (xn  - t2)  + t2]|N,{xn,  n - 1,2, . . . ,N-l}^j 
n-1 

(3.150) 


Using  the  memory less  properties  of  an  exponential  random  variable,  the 
Inner  expectation  of  (3.150)  becomes 

t N-1  \ _i 

e|th  - [w  - z - l <Tn  — t2>  t2)|N,{xn>  n - 1,2 N-1) J - X . 

n-1 

(3.151) 


Hence,  equation  (3.150)  becomes 


N 


E{  l (x  - t.)}  - v - z + X-1  . 


n-1 


n 2 


(3.152) 


Thus,  using  (3*149),  the  mean  mnber  of  steps  f^(w)  to  go  from 


«0  - v to  z is  given  by 
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) 


^(w)  - E(N) 


W - S + A 

X”1  + t„ 


-1 


(3.153) 


Now  let  p be  defined  as 


P " Pr{Vl  ■ °l"n  " Z) 


-Xz 


(3.154) 


Furthermore,  the  mean  number  of  customers  served  N in  a busy  period 

B 

A 

of  the  M/D/1  queueing  system  generating  the  sequence  {w  } Is  equal 

n 

to  (Cohen  [36]) 


E(Nb)  - (1  - t2X) 


-1 


(3.155) 


So,  the  mean  number  of  steps,  T_ , to  go  from  0 ■ z to  state  0 Is  given 

6 n 

by 


f,  - 1 + P I (l-p)“  n E(N  ) 
n-0  B 

- 1 (eA*  - 1)  (1  - t2X)_1 


(3.156) 


Finally,  since  t^v)  Is  the  sum  of  t^(w)  and  f 2,  f^(w)  Is  given  by 
ff(v)  ■ tx(w)  + 


1 + (eX*  + X(w-s))  (1  - t2X)-1 


(3.157) 

Q.E.D 
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i 


I 


« 


I 


I 


1 


3 


t 


■) 


) 


1 


Assumption  3.2  can  now  be  verified. 


Lemma  3.9 


For  a*  ■ max 


(“• 1 - “d 


for  all  o e (o*,l). 


lva(w,if*)  - Va(w,x*)|  lL(w) 


(3.158) 


where 


X(M_+t  ) 

L(w)  - (1  + (e  1 A + Xw)}  (1  - t2X) 


(w  + 2(tx  - t2)  + p(D2  - D1)}  (3.159) 


and  Hp  is  given  by  Lemma  3.6. 


Proof 

Let  a*  « raax^O,  1 - 2p(D  - D ')")  and  “ e Conslder  the 

sequences  {w^i,;^}  defined  in  (3.131)  where  the  parameters  and 
are  given  by  - Up,  T2  - Tq  and  M,p  is  given  in  Lerana  (3.6).  So, 
emma  ! 

(i,J) 


Lemma  3.7  Implies  that  if  w^  * 0 then  for  i * 0,  1 and  j ■ 1,  2, 


w- - 0.  Furthermore,  let 
n 


■ inf{n  >0:  w ■ 0} 

— n 


(3.160) 


and  so,  from  (3.131) 


rU.J)  _ .0  for  n - ft,  ftfl,  ... 


(3.161) 


Since  T«  - T and  T is  the  threshold  generating  nj,  then  (3.100)  and 
Z o a a 
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1 


the  definition  of  c^(w)  in  (3.101)  Implies 

ft 

Vw’*5>  - V"’7#  “ E l “n{c2(wn1,2))  - c2(wi°’2))} 

n*0 


(3.162) 


Mow  it  is  easily  observed  that 


(3.163) 


and  so 


lc2(wn1,2))  - c2<wi°,2)>  I 1 w + 2<t1  - t 2)  + P(D2  - Dl)  . (3.164) 


Thus,  (3.164)  is  bounded  by 


ft 


n.  ,„(1,2) 


!Vw’nS>  - ±E<  I “ U2(V 

n-0 


< E(N)  (w  + 2(tx  - t2)  + u(D2  - Dl)) 


< (®ax(w,MT  + tj)) 


• (w  + 2(tx  - t2)  + m(D2  - Dj^)} 


(3.165) 


Using 


3.8  for  the  expression  for  T (w), 

z 


, . XOL+t  ) 

|va(w,»S>  - Va(0,n*)|  1 t1  + le  + X (max(w  - H,.  - ^,0))} 


(1  - Xt2)_1{w  + 2(tx-t2)  + p(D2  - D1)} 


and  (3.158)  is  achieved  when  it  is  noted  that 


(3.166) 


max(w  - Hy  - t^.0)  <_  w 


(3.167) 

Q.E.D 
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and  so,  Pr(wn  - 0)  approaches  a limit,  as  n + »,  which  is  denoted 

by 

- lim  Pr(wn  - 0)  . (3.170) 

Let  the  sequences  (w  } and  {w^*^}  be  defined  as  in  Lemma  3.7 

n n 

with  T^  = T and  T2  - 0.  Then  Prfa*1’1*  £ W),  W £ 0,  is  obtained  by 
conditioning  on  the  events  w^  - 0,  l = 0,  1,  ...,  n,  and  so, 

Pr(w'[i,1)  < W)  - l Pr(w(i,1)  < W;  w > 0,  l - k+1, . . . ,n-l|w.  - 0) 

k-0  n % k 

• Pr (w  - 0)  + Pr(w(1,1)  < W;  w > 0,  i « 0,1,..., n) 

(3.171) 


To  show  that  the  limit  as  n -*■  « of  Pr(w^**^  £ W)  exists, 
first  consider  the  quantity  P given  by 


Pn  " > °»  1 m 1»2 n-l|w0  - 0) 

QO 

* I Pr(wk  “ 0;  w > 0,  l - l,2,...,k-l|w  - 0)  , (3.172) 
k»n  K * u 


for  n > 1 and  P^  - Pq  = 1.  Hence,  it  is  clear  that 

Pr(w*i,J)  < W;  wf  > 0,  l - k+1, . . . ,n-l|wk  - 0)  £ Pn_k  (3.173) 

Now  suaming  P over  n gives 
n 

N 

I P - N l Pr(0  - 0,  w > 0,  l - 1,2 i-l|w_  - 0) 

n-1  n k-N+1  * 1 0 

N 


+ I n Pr(w  - 0,  w > 0,  l - 1,2 n-l|v-  * °)  • 

n-1  n 1 0 

(3.174) 
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0 back  to  state  0 


But  the  mean  number  of  steps  T to  go  from  « 

Is  given  by 

N 

t - lim  £ n Pr(w  * 0;  w > 0,  *,  - 1,2, . . . ,n-l|wQ  ■ 0)  (3.175) 

n-1  " 

and  T is  finite  from  Lemma  3.8.  So,  using  (3.174)  and  (3.175), 


N 

t - l P - l (n-N)  Pr(w  - 0,  w > 0,  l 
n-1  n n-N+1  n * 


1, 2, • • . , 1~1 | “ 0) 
(3.176) 


Taking  the  limit  as  N ■>  ® of  (3.176)  and  noting  that  the  summation 
in  (3.175)  has  a limit,  f is  given  by 


N 

T - lim  7 P 
N-**>  n**l  n 

Thus,  for  every  c > 0 there  exists  a N(e)  such  that 


(3.177) 


f P < e (3.178) 

k-n 

for  N > n >_  N(e),  and  clearly,  from  (3.170)  for  ever)’  e > 0 there 
exists  a M(e)  such  that 


|Pr(wn  - 0)  - PQ|  < c 


(3.179) 


for  n £ M(e)  . 

Now  define  F„(W)  by 
N 

PN(W)  - l Pr(w^0,1) 
" k-0  K 


£ W;  w > 0, 


l 


l,2,...,k-l|w<0,1)  - 0). 

(3.180) 


Then  from  (3.171)  and  (3.173)  the  difference  between  Pr(w(1.,.1)  < W) 

ntN  — 

and  F (W)  is  bounded  by 
N 


Ipr<“^1>  iM>  - V»>l  i ? I«W  ' 0)  - polp 

«C*U 


+ l Pr(w  . =0)  p.  + Pr(w^i,1)<^  W;  v,  > 0,£  » 0,1 n+N) 

k-N+1  N+T1'k  * n 

(3.181) 


From  Lemma  3.8  it  is  easily  established  that  for  all  e > 0 there  exists 
a function  K(e)  such  that  for  all  n > K(e)  and  i = 0,1 


Pr(w(i,1)  < W;  w > 0,  l = 0,l,...,n+N)  < e 


(3.182) 


Now  let  N > max(N(e/3) , M(e/3f),  K(e/3)),  then  using  (3.178),  (3.179) 
and  (3.182)  in  (3.181),  it  is  clear  that  for  n > 0 and  i = 0,1 
/j  i \ N 

lPr(wrH-N  ^ < " Vw>l  1 l e V3^  + e/3  + e/3 

k=0 


< e 


(3.183) 


where  the  last  inequality  is  derived  from  (3.177).  Thus,  the  limit 
of  the  distribution  Pr(w^*’^  < W)  exists,  is  independent  of  i which 
implies  it  is  Independent  of  w^*^,  and  is  denoted  by 

F(W)  = lim  Pr(w(i,1)  < W) 


(3.184) 


To  show  that  F(W)  is  a valid  distribution  function,  consider 
lim  Pr(wn  ± W).  It  is  easily  shown,  using  Lindley’s  Theorem,  that  this 


limit  exists  and  is  a valid  distribution  function  which  is  denoted 


P(W)  ■ lim  Pr(w  < W) 
n — 


(3.185) 
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But  from  Lenma  3.7 


for  all  n,  and  80 


n — n 


F(W)  > F(W) 


Since  11b  *(W)  - 1,  by  (3.187)  lim  F(W)  - 1.  Hence,  F(W)  is  a valid 

W-*» 

distribution  function. 

The  limit  of  the  mean  of  as  n -*■  « is  shown  to  exist 

n 

by  noting  that 


E(w(i,1))  - [ (1  - Pr(w(1,1)  < W))  dW 

n I ~ n — 


E(w  ) - (1  - Pr(w  < W))  dW 

n ]_  n — 


From  (3.186)  and  (3.187)  for  W e [0,«) 


1 - Pr(w  < W)  > 1 - Pr(w(1,1)  < W)  > 0 

n — n — — 


and  hence 


1 - f(W)  > 1 - F(W)  > 0 


But  clearly,  E(w  ) < « for  all  n,  and 
n 


lim  E(w  ) - f (1  - fr(W)) 
xr-  n Jo 


dW  < • . 


Hence,  using  a version  of  the  dominated  convergence  theorem  (Royden 
[37],  p.  89) 


ms'T 


11m  E(v 


■ l 


(1  - F(W))  dW  < - 


Q.E.D 


Finally  Assumption  3.3  can  be  verified. 


There  exists  an  Increasing  sequence  {a^},  e (0,1),  such  that 

a +1  and  11m  T exists.  Furthermore,  the  connected  policy  it 
n a 

n-*»  n 


with  threshold  f given  by 


T - lim  T 

a 

n 


satisfies 


♦(w,n)  - 11m  sup  (1  - a^)  Va  (w,n*)  , 

n*“  n 


for  all  w c (0,»). 


Proof 


From  Lemma  3.5  and  Lemma  3.6,  there  exists  a a*  e (0,1)  and  a 


such  that 


-1  <_  Tq  < < »•  for  all  o e (a*,l) 


Since  Tq  Is  bounded  for  all  a e (a*,l),  there  exists  an  Increasing 

sequence  (a  },  a e (0,1),  such  that  ail  and  11m  T exists. 

n n n a 

ir**  n 


Let  5 be  the  connected  policy  with  threshold  f given  by  (3.194), 


for  such  a sequence  {a  ). 

n 


r 


I 


Now,  for  all  a e (0,1) 


|<t>(w,n)  - (1  - o)  Va(w,ir*)|  <_ 


|*(w,n)  - (l  - a)  Va(w,it)|  + (1  - a)  |Va(w,ir)  - Va(w,ir*)|  . 


(3.197) 


Using  I.emma  3.10  and  (3.39) 


*(w,ir)  ■ 11m  E*{c(w  ,a  ) w.  » w} 
n-w»  " n n 0 

- lim  tE-(wn>  + fc2  + uD2  + (^  - t2  + w(D1  - Dj)  Pr(wn  _<  f)}  . 

n-*«o 


(3.198) 


* 


From  (3.37)  for  all  N'  > 0 

N 


N'-l 


|V  (w,w)  - 11m  sup  £ an  4>(w,ir)|  < [ a"  E»(c(w  ,a  ) |w  - w) 

° N-*®  n-N'  n-0  * n n 0 

N 

+ 11m  sup  l an| E*(c(w  ,a  )|w  - w)  - *(w,w) | (3.199) 

n-N’  * « n 0 

But  from  (3.198),  for  every  e > 0 there  exists  a N(e)  such  that  for 
all  n > N(e) 


| E»{c(Wn»an) | Wq  * w}  - $(w,w)|  < e 


(3.200) 


Also,  Lemma  3.3  Implies  that  for  all  N > 0,  the  sequence  B„  defined 

— N 

by 


B - max  E« (c(w,a) |w  - w) 
N 0<n<N  " n n 0 


(3.201) 


Is  bounded.  Thus,  using  (3.200)  and  (3.201)  In  (3.199),  the  following 
bound  Is  found  when  N'  > N(e)s 
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N'  , N'  N' 

lVw’ff)  - - V-  V + r ~ e 


(3.202) 


Hence,  from  (3.202), 

|(1  - a)  V (w,r)  - *(w,<r)|  < (1  - aN’)  (*(w,S)  + B„,>  + aN'  e . 


(3.203) 


Now  consider  the  sequences  of  random  variables  {w  } and 

n n 

which  are  discussed  in  Lenina  3.7,  where  • max  (t,Ta>, 


” min  ($,T  ),  and  a e (a*,l).  So,  from  (3.100) 


N 


V (w,tt  ) - lim  sup  I an  E{c  (w^1*^)) 

“ Tj  N~>  n-0  j n 

where  c^(w)  is  given  by  (3.101).  So,  the  difference  between  the 

a~discounted  cost  under  n*  and  n is  bounded  by 

a 


(3.204) 


N 


|V  (w,ir*)  - V (w,ir)|  < lim  sup  l a E{|c1(w^  * *)  - c?(w*1,2))|>  . 
a N-**>  n«0  Zn 

(3.205) 


Now  define 


and 


w(l,l)  „ w(l,2) 

and  T„  < w*lfl)  < T,} 

, (3.206) 

n n 

2 n — 1 

N.  s w - 0} 
l n 

1 * 0|1|2|« • • 

(3.207) 

.«-2>  »a  i,  < .<i-i> 

< T.}  . 

i+1  n 

n 2 n 

— 1 

i m 1,2,... 

(3.208) 

, If  n satisfies 

- n < nl’  then  the 

difference 
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in  waiting  times  is  bounded  by 


,a.i>  _w(i.» | 


and  so 


(1,2), 


In  a similar  manner,  for  a fixed  value  of  i,  if  n satisfies 
n^  £ n < then  the  difference  in  costs  is 

lcl(wn1,1))  “ c2(wi1,2))l  " 0 • (3.211) 

Thus,  (3.205)  can  be  written 

N ^i" ^ 1 

I va (w»ir*)  - V (w,#)|  < lim  sup  l E l an|c.(w(1»1))  - c,(w(1»2))|| 

N~°  i-0  n-N  1 n 2 n | 


N ni_1 

< lim  sup  E l an  [T.-T,  + 2(t.-t.) 

i-0  n-N,  1 1 12 


+ p(D2  - D1)] 


Now  clearly. 


l n Ni 

E<  l “ IV  1 o 1 E(nA  - 1 - NjN  ) i - 0,1,2,...  (3.: 

n-Nj 

and  E(n^  - 1 - N^|n^)  Is  equivalent  to  the  mean  number  of  steps 
to  go  from  w*1,J)  - w,  where  T2  < w _<  to  wr  - 0.  So,  using 
Lma  3. A and  (3.213), 
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Now  substituting  (3.203)  and  (3.221)  into  (3.197),  the  bound 


l*(»,5)  ~ (1  “ a)  V (w,**)|  < (1  - aN')  (♦(*,£)  + BnI)  + aN'e 
u a w 

-A | t-T  | 

+ B(1  - e “ ) . (3.222) 

Replacing  a with  c»n  in  (3.222),  it  is  clear  that  for  any  e > 0 there 
exists  a N such  that  for  all  n > N 

|*(w,n)  - (1  - «n)  Va  (w,n*  )|  <_  e (3.223) 

n n 

Hence,  from  (3.223),  the  results  of  the  lemma  follow. 

Q.E.D 

Finally,  using  Theorem  3.2  the  average  optimal  policy  is 
established. 

Theorem  3.3 

For  the  SCADCP  outlined  in  Section  3.2,  which  uses  two  data 
compressors  with  associated  distortion  levels  < D2  where 

•1  c 

D2  > r (y  - p2),  the  connected  policy  v*  with  threshold  f given  by 
(3.194)  is  the  aversge  optimal  policy.  The  policy  it*  selects  data 
compressor  1 when  the  waiting  time,  w,  of  a message  satisfies  w <_  t, 
and  selects  data  compressor  2 otherwise. 
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By  LenmaB  3.3,  3.9,  and  3.11,  Assumptions  3.1,  3.2  and  3.3  are 
shown  to  be  valid.  Therefore,  the  results  stated  follow  by  Theorem  3.2 

Q.E.D 

Theorem  3.3  provides  that  a stationary  deterministic  policy 
satisfies  (3.31).  This  implies  that  a policy  exists  which  satisfies 

(3.19) .  Thus,  the  representation  for  given  in  equation  (3.16) 

is  valid.  Using  the  knowledge  that  the  policy  which  satisfies 

(3.19)  is  a connected  policy,  y^D)  is  evaluated  in  Chapter  IV. 

3.5  Conclusions 

In  this  chapter,  the  delay  distortion  relationship  for  communica- 
tion networks  was  introduced  and  an  adaptive  decompression  scheme 
was  presented.  The  delay  distortion  relationship  represents  the 
minimum  average  message  delay  as  a function  of  the  fidelity  of  the 
reproduction  of  the  sources.  For  the  adaptive  data  compression 
scheme  considered,  a functional  form  of  this  relationship  was  presented 
In  the  process  of  deriving  the  functional  form,  Markov  decision 
processes  were  introduced  and  results  pertaining  to  these  processes 
were  discussed.  The  results  presented  apply  specifically  to  the  case 
of  Markov  decision  processes  with  nondenumerable  state  spaces,  and 
unbounded  cost  functions.  Using  these  results,  for  a single  channel 
network  the  optimal  structure  of  the  adaptive  data  compression  scheme 
was  found  for  a class  of  adaptive  compression  schemes  which  employ 


two  data  compressors 


CHAPTER  IV 


DELAY  DISTORTION  RELATIONSHIP  FOR  A SINGLE  CHANNEL 

In  this  chapter  the  delay  distortion  relationship  for  a 
communication  system  employing  a single  channel  is  considered.  The 
communication  system  utilizes  an  adaptive  data  compression  scheme 
as  described  in  Section  3.2.  In  Section  4.1,  the  results  of 
Chapter  III  are  Incorporated  into  the  evaluation  of  a delay  distortion 
relationship.  This  relationship  is  specified  by  optimising  an  appro- 
priate functional.  Some  of  the  properties  of  this  relationship  are 
discussed  in  Section  4.2.  In  Section  4.3,  the  functional  to  be  opti- 
mized is  described  in  terms  of  an  Integral  equation.  This  Integral 
equation  is  solved  for  the  case  of  a data  compression  scheme  which 
usee  two  data  compressors.  For  the  case  of  a data  compression  scheme 
that  utilizes  more  than  two  data  compressors,  an  expression  for  the 
functional  is  derived  in  Section  4.4  for  a dense  set  of  values  in  the 
domain  space  of  the  functional.  Using  these  results,  numerical 
results  for  the  delay  distortion  relationship  for  various  schemes  using 
multiple  data  compressors  are  presented  in  Section  4.5. 

4.1  A Delay  Distortion  Relationship  for  a Single  Channel 

For  the  case  of  a single  conaunlcatlon  channel  and  a data  com- 
pression scheme  utilizing  two  data  compressors,  as  described  in 
Section  3.2,  Theorem  3.3  proves  that  an  optimal  policy  exists  which 
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I 


t 

I 


where  D*  is  the  average  distortion  associated  with  {D*}  and  with  the 

2 

connected  policy  with  threshold  T*  and  {D*}  and  T*  minimize  ^({D^}). 
Thus,  the  delay  distortion  relationship  reduces  to  the  minimization 
of  a functional  over  a vector  space. 

When  more  than  two  data  compressors  are  used  in  the  compression 
scheme,  K > 2,  the  structure  of  the  optimal  policy  has  not  been 
determined.  From  the  results  in  Chapter  3,  it  seems  natural  that  the 
optimal  policy  should  be  a connected  policy  utilizing  thresholds 
(T^,  k - 1,  2,  ...,  k-1)  with  respect  to  the  observed  waiting  times. 
This  policy  selects  data  compressor  k for  the  n^  message  whenever  the 
waiting  time  w^  of  the  message  satisfies  T^  ^ < wn  — w^ere 


Tq  - 0 


(A. 4) 


and 


(4.5) 


So,  restricting  the  class  of  data  compression  schemes  to  those  employ- 
ing stationary  connected  policies,  a delay  distortion  relationship, 

to  be  denoted  as  Yg(0) , results.  In  order  to  determine  y^(D),  let 

K 2 

L{T  }^Dk^  ke  defined  in  a similar  manner  as  ^({D^}).  So,  if  * 

is  a connected  policy  with  thresholds  (T^),  then 


L(ik)<(Dk»  ' L‘<(Dk»  • 


(4.6) 


Thus,  the  delay  distortion  relationship  for  connected  policies  is 
given  by 
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(l(Tk)<{Dk,))  - "D* 


(4.7) 


yJ(d*) 


min 

{0>0,k-l,2,..,K} 
(T*e>-1.V-1.2 1 


where  D*  is  Che  average  distortion  associated  with  {D*}  and  with  the 

connected  policy  with  thresholds  {T*}  and  {D*}  and  {T*}  minimize 

K *1 

L{T  Hence,  the  relationship  yr(D)  reduces  to  the  mlnimlza- 

^ 1 
tion  of  a functional  over  a vector  space.  Clearly,  given  in 

(4.3)  is  equal  to  Yj(D)  and  so,  the  remaining  sections  of  this 

chapter  are  restricted  to  the  discussion  of  y£(D). 

~1  K 

In  order  to  evaluate  y^(D),  expressions  for  ^({D^})  must 
be  derived.  In  Section  4.3  and  4.4,  the  expression  L^T  }({D^}) 
is  derived.  Once  these  expressions  are  derived,  optimization 
techniques  are  employed  to  determine  Y^(D)  in  Section  4.5. 


4.2  Properties  of  a Delay  Distortion  Relationship  for  a Single  Channel 
Some  of  the  properties  of  the  delay  distortion  relationship  Y^(D) 
defined  in  Section  4.1  are  outlined  in  this  section.  The  first 
property  is  that  7g(D)  is  a nonincreasing  function  of  the  average 
distortion  level  D.  This  is  clearly  seen  by  considering  (3.13), 
which  gives 

?J(D)  - inf  T*({D.»  (4.8) 

xelKDjtDfc})  * 

{D^p,k-1,2,..,K> 


where 

n(D;  (D.  })  -{wens  DK((D.  })  < D}  (4.9) 

K C 1*  H — 


no 


and  II  is  the  set  of  all  stationary  deterministic  connected  policies 
which  utilize  the  observed  waiting  times  as  a statistic  to  base  the 
decisions  upon.  Thus,  for  {D^}  fixed,  sets  H(D;  {D^})  over  which  the 
infimum  is  taken  and  are  decreasing  telescoping  sets  as  the  dis- 
tortion level  D Increases  (e.g.,  if  6^  < then 
n(D^;  {D^})C  R ( 62 S t } ) ) . Hence,  the  infimum  in  (4.8)  is  a non- 
increasing function  of  D. 

Another  property  of  y*(D)  which  is  of  interest  is  the  minimum 

K 1 K 

value  of  distortion,  D , such  that  y„(D)  is  finite.  So,  D 

min  K.  min 

is  given  by 


inf  D 


where 


- (D:  ?£(D)  is  finite}  . 

For  the  case  of  R ■ 1,  D*.  is  specified  in  (3.8)  to  be  r *(C/A). 

min 

For  the  case  K > 1,  the  following  proposition  establishes  the 


corresponding  values  of  D . . 

mm 


Proposition  4.1 

Let  the  rate  distortion  relationship,  r(D),  that  the  set  of 
data  compressors  satisfy,  be  a decreasing  convex  function  of  D,  and 
let  pK  be  the  protocol  information  required  to  specify  to  the  decoder 
and  the  user  which  compressor  is  used  to  compress  a message.  Then 


Dmin*  d***ne<*  (4.10).  given  by 


(4.12) 


‘ rl<C/>  ' V 


Proof 


Let  be  the  average  probability  of  using  compressor  k during 


the  compression  of  n messages,  where  is  given  by 

n-1 
-i 

rk  “ " 


^ as  a . 

p”  * n £ Pr(iC  message  compressed  by  k DC) 


(4.13) 


i-0 


Let 


tk  - (5(V  + pk)c 


-l 


(4.14) 


denote  the  transmission  time  of  a message  and  the  associated  protocol 
information  which  use  the  kC^  data  compressor.  Clearly,  the  buffer 
in  the  communication  system  behaves  as  a queue  in  a queueing  system 
with  mean  message  interarrival  time  X 1 and  average  service  time 
over  n messages  given  by 

K 


°n  " l tk  Pk 

n k-l  K K 


(4.15) 


Thus , the  average  traffic  intensity,  being  the  ratio  of  the  average 
service  time  to  mean  lnterarrlval  time,  is  given  by 


K 


■>  J tkfE  • 


k-l 


(4.16) 


Correspondingly,  the  average  distortion  is  given  by 


I ■ 


k-l 


(4.17) 


Now  assuming  that  {D^}  and  p£  can  be  selected  Independently, 
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t 
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consider  the  problem  of  determining  a set  {D^}  that  minimizes  Dn  for 
a fixed  value  of  n subject  to  the  constraint  that  the  average 
traffic  intensity  Is  less  than  or  equal  to  one,  p^  <_  1,  for  a given 
set  p£,  k - 1,  2,  K.  Using  Jensen's  inequality,  it  is  clear 

that  for  every  set  p£,  k - 1,  ....  K,  the  set  {Dfc}  that  minimizes 
Dn  subject  to  the  constraint  is  given  for  all  k and  n by 

Dk  " ?_1(C/X  " PR> 

- D*  (4.18) 

K 

To  show  that  D^n  * D*,  it  is  required  to  find  schemes  whose 
distortion  levels  are  greater  than  D*  which  have  finite  limiting 
average  delay  and  show  that  no  scheme  exists  with  finite  limiting 
average  delay  with  distortion  less  than  or  equal  to  D*.  First, 

let  D > D*  and  Dfc  - D for  k - 1,  2 K.  Then,  Dn  - D and  pr  < 1 

for  all  n,  and  the  limiting  average  waiting  time  W for  this  M/D/1 
queueing  system  is  given  by 

_ (?(D)  + p )C_1  A (r  (D)  + p )C-1 

w 5 : — T <4*19> 

1 - A(r(D)  + pK)C  1 

Hence,  from  (4.18),  the  limiting  average  waiting  time  is  finite, 
and  so,  the  limiting  average  delay  f^(D)  is  finite  for  D > D*. 

For  the  case  D D*,  the  waiting  time  of  the  (n+l)8t  message 
is  given  by  (3.21)  as 
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w - max{w  + t 
n+1  n a. 


- T 


n+1 1 


0} 


Ti+1) * 


(4.20) 


So,  w is  lower  bounded  by 


and  using  (4.16)  , 


(4.21) 


^(w^)  1 n X”1(pn  - X)  * (4.22) 

From  Jensen' 8 Inequality  if  D - lim  inf  Dn  < D*,  then  lim  inf  p > 1. 

n-»*»  n 

Thus,  if  lim  inf  p > 1,  from  (4.22)  lim  inf  E{w  } is  unbounded  and 

ir*»  n n-H»  n 

the  limiting  average  message  delay  is  unbounded.  For  the  case 

lim  inf  pn  ■ 1 and  D < D*,  D equals  D*  by  (4.18)  and  the  data 
n 

compressors  have  distortion  levels  of  D*.  Thus,  from  Llndley's 
theorem,  the  limiting  average  delay  is  unbounded.  Hence,  (4.12) 
is  valid. 

Q.E.D 

/ 


4.3  Evaluation  of  L,_  .({D,}) 

{T^)  k 


In  order  to  compute  L*T  }((Dkl)  given  by  (4.6),  an  expression 
for  the  waiting  time  probability  distribution  F(W)  must  be  obtained. 
In  this  section  an  Integral  equation  is  determined  whose  solution  is 
F(W)  and  this  Integral  equation  is  solved  for  the  case  of  two  data 


3 


3 


3 


) 
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compressors,  K ■ 2. 

Generalizing  Lemma  3.10  to  K data  compressors.  It  is  easily 
shown  that  If  the  data  compressors  are  numbered  in  order  of  Increasing 
distortion  levels  (e.g.,  <_  D^^),  with  tR  X 1 and  TR_^  < 

then  F(W)  is  a valid  distribution  function.  Since  F(W)  exists, 

L^t  }({Djc>)  is  given  in  terms  of  F(W)  by 

Lfr  " lim  En  >(wn  + * + WD  > 

{Tk}  k ir~  {Tk}  n an  an 

f“>  K 

- WF(dW)  + l (t.  + uD  ) (F(T  ) - F(T  .)]  (4.23) 

Jq  k«l  K K K K-X 


where  F(Tp)  ■ 0 and  F(TR)  * 1.  The  following  theorem  specifies  an 
integral  equation  whose  solution  is  F(W). 


Theorem  4.1 

The  limit,  F(W),  of  the  probability  distribution  of  the  waiting 

time  of  a message  in  a data  compression  scheme  using  K data  compressors 

with  distortion  levels  {D^},  and  using  a connected  stationary 

deterministic  policy  with  thresholds  (T^}  satisfies,  for  W 5^0. 

K -AflHt  -W> 

F(W)  - l [F(min(0,T.  ))  - F(T  .)]A  e d«  (4.24) 

k*l  max(Tk^1  W-tfc) 

where  F(TQ)  ■ 0,  F(TR)  “ 1,  min(w,  TR)  - w,  and  tfc  is  given  by 
(4.14).  Furthermore,  F(W)  is  the  unique  solution  to  (4.24)  subject 
to  the  constraint  that  F(W)  is  a continuous  and  piecewise  differentiable 

function  and  the 

lim  F(W)  - 1 . (4.25) 

\b~ 
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Proof 


Using  the  connected  policy  with  thresholds  (Tk>,  it  is  clear 
that  the  waiting  tine  of  the  n**1  message  is  given  by 


w ” max(0,  w - + t.  - x ) 
n * n-1  k n 


(4.26) 


where  k is  such  that  <_  and  { t^}  are  independent 

identically  distributed  exponential  random  variables  with  mean 


.-1 


A . Thus,  Pr(w  < W,  T < w - < T.  t ■ t)  is  given  by 
n k— l n-i  k n 


Pr<Ik-i<ViiTk>  > if  WT  i " 


Pr(V“-Tk-i<ViiTklVT)  ■\Pr<Tk-i<Vii“-tk+T)  - 


11  Tk-l4tk',^Ik+tk-T 

’ “ Vi+VT  ‘ *• 

(4.27) 


Averaging  over  t in  (4.27), 


Pr<»„iW.  Tk_l  <«„.,£  V 


J Pr<1k-1  * Vi  i Tk 


)A  e dT 


ix(Tk+tk-W,0) 


+ j Pr<Tk-l  < Vl  1 w ” \ * t)X  e"XT  dr 

•“(Vi+tk-w-°> 

] Pr^Tk-l  < Wn-1  - >in^Tk»  w “ \ ♦ T>)x  e~XT  dT. 

max(T.  .+t.-W,0) 

(4.28) 
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Lettlr.g  • ■ t - ♦ W#  (4.28)  becomes 

Pr<".  i "•  Vi < Vi  i V 

- [ Pr(T  | < w - <.  min(T.  ,W))X 

■“(Tk-1*W"tk)  k 1 n_1  k 


-X  (v+t. -W) 

e dw  . (4.29) 


Sunning  (4.29)  over  k,  Pr(w  w)  Is  given  by 


Pr  (w  < W) 
n — 


■ i r 

k-1  > 


Pr(Tk  l < »n  l £mln(Tk,w))X  e 


-X(w+tk-W) 


max(Tk_lfW-tk) 


dw  . 
(4.30) 


C 


0 


Since  the  limiting  probability  distribution. 


F(W)  - lim  Pr(w  < W) 
trx» 


(4.31) 


exists,  the  dominated  convergence  theorem  gives  (4.24). 

Nov  suppose  F*(W)  is  a continuous  piecewise  differentiable  func- 
tion which  satisfies  (4.24)  and  (4.25).  The  derivative  of  F*(W)  is 
computed  from  (4.24)  and  is  given  by 


dF*(W) 


dU 


X (F* (W)  - G*(W) ) 


(4.32) 


where 


G*(W)  - 


F*(W  - tk)  , if  Tk _1  < W - tk  < Tk,  k - 1,2 K 

0 , otherwise 


Since  F* (W)  is  a continuous  snd  piecewise  differentiable  function, 
F*(w)  can  be  expressed  as 


- j'  <?£?-')  <»*  + F*(0)  . 


F*(  W)  - I 

'o 


(4.33) 
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Clearly  from  (4.32)  and  (4.33),  If  F*(0)  >_  0,  then  F*(W)  is  a non- 

negative  nondecreaslrg  function;  and  If  F*(0)  < 0,  then  F*(W)  is  a 

negative  nonlncreaalng  function.  But  F*(W)  satisfies  (4.25),  and  so 

F*(0)  > 0 and  F*(W)  is  a valid  distribution  function.  Now  let  F*(W)  be 

the  initial  waiting  time  distribution  of  the  queueing  system  and  let  the 

waiting  time  distribution  of  the  nth  message,  F*(W),  be  given  by 

K f-  -X  (w+t, -W) 

Ww)  ' l-i  J (P*(«i»(e.ik))  - F;ttk.x>l»  * <*“' 


.«(Tk-l,W-tk) 


(4.34) 


But  from  a generalization  of  Lemsut  3.10 


lim  F*(W)  - F(W)  . 

a—  n 


(4.35) 


Therefore,  since  F*(W)  satisfies  (4.24),  it  is  clear  that  F*(W),  F(W) 


and  F (W)  are  all  identical, 
u 


Q.E.D 


For  the  case  of  more  than  two  data  compressors,  K > 2,  the 
solution  to  the  integral  equation  (4.24)  is  quite  complex.  For 
this  case,  an  alternative  expression  for  ^({D^})  is  discussed 
in  Section  4.4.  The  following  theorem  determines  F(W)  for  the 

case  K - 2. 


Theorem  4.2 

-Xtf 

Let  ■ e , 1 ■ 1,  2.  Then  the  solution  of  the  integral 
equation  (4.24)  for  K ■ 2 la  given  by 


I m 


AW 

c e G (W) 
C1 


, if  0 < W <_  T 


F(W)  - 


e (gt  u2  * [V  (W)  " Gt  (W)l  ♦ V2  G2 


“At-  -At- 

- GT[e  Gg(W  “ Tx  - t2)  - • 1 Ge(W  - Tj_  - t^] 


T1  Tl+tl  1 

+ [gg  (w)  - gga  i(W)]j 


* lf  T1  < W (4.36) 


for  the  case  Tj  + <_  t^  and  is  given  by 


AW  *1-,,- 
c e Gq  (W) 


, if  0<W£T1 


F(W)  - 


AWf  T1  T1  -1  Tl+t2  T1+t1 

e [G  A(W)  - G A(W)]  + G u A 1[Gf.1  2(W)  - G 1 1 <W)  ] 

1 l2  C2  C2  *2 

-At.  -At. 

- GT^[e  Ge(W  " Tt  - t2)  - e 1 - Tj  - t1>] 

t.  T.+t-  | 

+ IGgA(W)  - GgA  A(W)]  ♦ P2  g2  GtA  ^(w)) 

, if  T.  < W (4.37) 


for  the  case  t^  < The  constants  G^t  G^t  GT  and  c in 

(4.36)  and  (4.37)  are  given  by 


°i  - (1  - g!0<ti  + 


(4.38) 


G2  " + tx)  - G^(T)  (Mj  - M2)A_1)m21  , 


(4.39) 


GT  - G^(T) 


S V2  X~  l >(W  X(T1-K1)l  u2G2  ) 

Y*,  J + TT^Tt^T e 

XT  {—  — ] 

Gt  e (tj-t2)  *1  T,  TH, 

(F^Tt^T  - l0  Ki'-'r  *> 


• if  TJ  + t2  < t2 


AT,  At 


• *-e  1 . GTM2  X i *<Wi 

(1  - At2)  4 (1  - At2)  1*  " e J 


+ > g2  ><W  + GT«  (trt2) 

6 - x*2)  (l  - xt2) 


Tl+t, 

t-V-1) 

r1  *1  T1+t1 

♦ l tt  1 - L 1 X) 
r-0  r r 


.if  < Ti  + c2 


The  syebol  [x]  is  the  greatest  Integer  less  than  or  equal  to  x and 

Lr  is  given  by  (A.53)  in  the  Appendix.  The  functions  GT  (V), 

T *i 

Gg(V)  and  G#(W)  are  given  in  (A.16),  (A. 24),  and  (A.39),  respectively, 

in  the  Appendix. 


COO  - c"1  e“AW  P(W) 


r 


where  c Is  selected  such  that 


G(0)  - 1 . 


(4.43) 


So,  upon  substituting  of  (4.42)  Into  (4.24)  the  resulting  Integral 
equation  for  G(W)  is  given  by 


G(W)  - l y 


k-1 


r i 


G(min(»,Tk)  e 


X(min(0,T  -a) 


“x(Tk-i'“-tk) 


- G(T, 


X(Tk-l^l 

,-!>«  J 


da 


So,  simplifying  (4.44)  and  letting  T ■ T^, 


*1  I0  G<y)  * U2  L 


fy,  | G(y)  dy  + y,  G(y)  dy  + G(T)  (p  - y )X 

T 1 1 


-1 


if  W <_  min(t  , T + t-) 


|yx  | G(y)  dy  + p2  | G(y)  dy 


r 2 

-l 


+ G(t)  (yx  - P2)X 


if  tj  < W < T + t2 


(4.44) 


G(W) 


! [T  #00 

U1  J G(y)  dy  + p2  J G(y)  dy  + G(T)  ^ X_1  - e_X(W'T))  , 

0 W“"  t n 


If  T + t2  < W £ t1 


tT  #00 

y.  G(y)  dy  + y G(y)  dy  + G(T)  (y.  X-1  - e"X(W"T)), 

JW-t,  1 h -t„  1 


if  max(t1,  T + t2)  < W < T + tj 


y,  r G(y)  dy 

1 Jw-t„ 


, if  T + t1  < W 


(4.45) 


C 
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Now  let 


G,  - f G(y)  dy 
J0 

(4.46) 

C2-f  G(y)  dy 

(4.47) 

Gt  - G(T) 

(4.48) 

and 

Go- 

W1  G1  + P2  G2  + GT(W1  " y2)Al 

(4.49) 

Fron  (4.45) 

G(0)  - Gq  , 

(4.50) 

and  so  fro«  (4.43) 

S'1  * 

(4.51) 

Thua,  upon  substituting  (4.46)  - (4.51)  into  (4.45), 

the  expression 

for  G(W)  reduces  to 


, if  W <_  min^,  T + t2) 

, if  tL  < W _<  T + t2 
rW-t. 

- u2  | L G(y)  dy  » 

, if  T + t2  < W £ tx 

- Vj  f ^ G(y)  dy 
1 J0 

if  max(tlt  T + t2)  < W £ T + ^ 

, if  T + tx  < W 

(4.52) 

In  solving  (4.52)  for  G(W),  two  cases  need  to  be  considered. 

These  are  ^ _<  T + t?  and  ^ > T + t2»  For  the  first  case  of 
<_  T + t2#  the  solution  is  found  by  dividing  the  domain  of  G(W) 
into  regions  and  solving  for  G(W)  in  these  regions.  From  (4.52), 
in  the  region  0 <_  W <_  t2 

G(W)  - 1 , (4.53) 

and  in  the  region  tj  < W « T + t2,  G(W)  satisfies  the  integral  equation 

rW-t 

G(W)  - 1 - H.  1 G(y)  dy  . (4.54) 

1 J0 

Letting  G(W)  • 0 for  W < 0,  it  is  clear  that  (4.54)  is  in  the  form  of 
(A. 15)  for  0 < W T ♦ t2<  The  solution  to  (A.15)  ia  given  in  Leona 
A. 2 in  the  Appendix.  Hence,  G(W)  is  given  by 


fW-t 

- u.  G(y)  dy 

1 ■’0 

t r1  - 


G(W)  - 


|l  + Gt(p2  r1  - e-x(w-T)) 


fW-tj 

•U2jT 


G(y)  dy. 


T 

W-t, 


1 *i2G2  " “2  JT  G(y)  dy 
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(4.55) 


G(W)  - G"  (W) 
C1 


for  0 < W <_  T + t.  where  (W)  la  given  by  (A.16).  From  (4.52), 

C1 

in  the  region  T + t2  < W £ T + t^,  G(W)  satisfies 


G(W)  - 1 + Gt(w2X"1  - e"X(W_ 


t)  rw_ti  r^~^2 

;)  - p-  1 G(y)  dy  - p L G(y)  dy  • 

1 j0  1 '0 


Since  W - t.  < T + t„,  the  expression  for  G(W),  (4.55),  can  be  used 

fW-t. 


p-t 

evaluate  p.  G(y)  dy  to  get 

j0 

fW-ti  f^-tl  0 

p G(y)  dy  - p 1 g” 

J0  1 J0  1 


(y)  dy 


Using  (A. 15),  (4.57)  becomes 


fw-t i 0 

i 1 G(y)  dy  «•  - GJ.  (W)  + 1 

1 '0  C1 


and  upon  substitution  in  (4.56),  G(W)  satisfies 


- Gt(p2A_1  - e"X(W_T))  + G°  (W)  - p2  | 2 G(y)  dy  . (4.J 


Finally,  from  (4.52)  in  the  region  T + t^  < W,  G(W)  satisfies 


G(W)  - p2  G2  - p2 


G(y)  dy 


(4.60) 


Combining,  (4.54),  (4.59)  and  (4.60),  for  W > 0,  G(W)  satisfies 


C,(W)  - Gt(u2A'1  - e“X(W"T))  u(W  - T - t2)  + G°  (W)  u(W) 

+ {P2  G2  ' W”1  _ e"X(W_T))  - gJ  (W)}  U(W  - T - tx) 


’ Jo 


G(y)  dy)  u(W  - T - t2) 


(4.61) 


where 


u(W) 


II  W > 0 

0 W < 0 


(4.62) 


Clearly  for  W ^ T,  G(W)  satisfies  an  Integral  equation  of  the  form 
(A.l).  Using  the  solution  (A.5)  derived  in  the  Appendix,  G(W)  is 
given  for  the  case  t^  <_  T + t2  by 


G°  (W) 
C1 


if  0 < W < T 


G(W) 


Gt  V2X”1{Gt  2(W)  ■ Gt  tl(M)}  + W2  c2  Gt  tl(W) 

—Xtj 

- G (e  G (W  - T - t_)  - e 1 G (W  - T - t.)} 
x e i el 


T+t 

+ (cJ(W)  - cG  X(W)} 


if  T < W 


(4.63) 


where  G^  (W)  satisfies  (A. 15)  and  is  given  by  (A. 16),  gT(W)  satisfies 

Ci  G 

(A. 23)  and  is  given  by  (A. 24),  and  G (W)  satisfies  (A. 38)  and  is  given 

e 

by  (A. 39). 

For  the  second  case  of  > T + t^,  again  the  solution  is  found 
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by  dividing  the  domain  of  G(W)  into  regions  and  solving  for  G(W) 
in  these  regions.  From  (4.52),  in  the  region  0 < W < T + t 


G(W)  = 1 - G"  (W) 

where  G°^(W)  is  given  by  (A. 16).  From  (4.52),  in  the  region 
T + c2  < W < t1#  G(W)  satisfies 


G(W)  - 1 + G (u,X  1 - e_A(W"T))  -n  [ 2 G(y)  dy  (4.65) 


and  in  the  region  tj  < W <_  T + tj,  G(W)  satisfies 


1 + GT(W2A  1 - e'X(W‘T))  - p f 1 G(y)  dy  - p [ 2 G(y)  dy  . 

J0  z Jq 


(4.66) 


Now  for  W - tj  < T + t2,  the  expression  for  G(W),  (4.64),  can  be 

used  to  evaluate  p 1 G(y)  dy  to  get 

1 '0 

fw-tl  fW-t 

vl  G<y)  dy  - Pj  1 G°  (y)  dy  . 

‘*0  A Jq  ci 


(4.67) 


Using  (A.15)  in  (4.67),  the  expression  in  (4.67)  becomes 


rW-t 

Ji  1 G(y)  dy  - - 

J0  ti 


(W)  + 1 


(4.68) 


Upon  substitution  of  (4.68)  into  (4.66),  G(W)  in  the  region 


< w 1.  T + t^  satisfies 
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c 


G(W)  - Gt(u2X 


■1  . e-X<W-T>)  + G°  - , f't2  c(y)  dy  . 
C1  * }Q 


(4.69) 


Finally,  from  (4.52),  in  the  region  T + t^  < W,  G(W)  satisfies 


G(W)  - v2  G2 


fW-t, 

">\o  ‘ 


G(y)  dy  . 


(4.70) 


Combining  (4.64),  (4.65),  (4.69)  and  (4.70),  for  w >_  0,  G(W)  satisfies 
G(W)  - u(W)  + Gt(u2X_1  - e"X(W_T))  u(W  - T - t2)  + (G°  - 1)  u(W  - ty) 
+ {m2  g2  - gt(u2X"1  ■ e_X(W_T))  - G°  ) u(W  - T - t^ 


- lv. 


rw_t2 

•*0 


G(y)  dy]  u(W  - T - t2) 


(4.71) 


As  in  the  first  case,  for  T <_  W,  G(W)  clearly  satisfies  an  integral 
equation  of  the  form  (A.l).  Using  the  solution  (A. 5)  derived  in  the 
Appendix,  G(W)  is  given  for  the  case  t^  > T + t2  by 


G°  (W) 
C1 


if  0 < M < T 


G(W) 


T ^1  —1  ^^2  T+t. 

(G*  (W)  - G A(W) } + G v X A{G.  ^(W)  - G A(W) } 

C2  C2  1 Z C2  C2 

-Xt,  -Xt. 

- G (e  z G (W  - T - t,)  - e 1 G (W  - T - t.)) 
re  i.  el 

t T+t  T+t 

+ {GgA(W)  - Gg  A(W)>  + p2  G2  Gt  A(VJ)  , if  T < V . 

(4.72) 


In  order  to  find  F(W),  the  constants  G^,  G2,  and  G^  must  be 


c 
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evaluated.  From  (4.48),  (4.63),  and  (4.72)  GT  Is  given  by 

C_  - G°  (T)  . (4.73) 

1 

From  (4.46),  (4.63)  and  (4.72),  G^  is  given  by 


G 


1 


G(y)  dy 

(y)  dy  . 

C1 


(4.74) 


Using  (A.1S)  in  (4.74),  the  expression  for  G^  becomes 

G1  " (1  - Cj  (T  + t1))vj1  . 

Substituting  (4.73)  and  (4.75)  into  (4.47)  and  solving  for  G^, 
G2  is  expressed  by 


(4.75) 


C2  “ {Gt1(T  * tl)  " CT)  (^  - P2)A"1)p21  . (4.76) 

Finally  the  constant  c in  equation  (4.42)  must  be  evaluated. 

It  is  evaluated  using  the  constraint  lim  F(W)  - 1 given  in  (4.25). 

From  (4.42),  this  constraint  implies  that  c satisfies 

c-1  - lim  (eXW  G(W))  . (4.77) 

W*« 

Now  using  the  expressions  (4.63)  and  (4.72)  for  G(V)  in  (4.77)  and 
then  making  use  of  the  expressions  (A.45),  (A.52),  and  (A.62),  the 
expression  (4.41)  for  c_1  is  obtained. 
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Hence,  solving  (4.42)  for  F(W)  and  using  (4.41),  (4.63),  and 
(4.72),  the  expressions  (4.36)  and  (4.37)  for  F(W)  are  verified. 

Q.E.D 

Using  the  expressions  (4.36)  and  (4.37)  for  F(W),  the  limiting 
average  waiting  time  is  evaluated  In  the  following  theorem. 


Theorem  4.3 


The  mean. 


Wd  F(W)  , 


associated  with  F(W)  is  given  by 

T1 

It1! 


1 r1  E A (I ,-rt  ) -X(T  - rt  )u 

-c-T  I (I  e 1 1 ( L 1 - 

r-0  u-0  u! 

.ifAd+t)  T +t  Xt 

+ gt  V e (TiT7i  + 

L 1 xt2  2(1  - Xt2)' 

‘‘W  T,  * tt  >t22  1 

e ^i  - xt  + y)  I 

Xt2  2(1  - Xtx)Z  J 


T1  + tl  Xt2 

+ V2  G2^1  - Xt  + 2^ 

Xt2  2(1  - Xt2)Z 


+ G rX'i(HW  - X(Vt2)  , C1  - t2  qt2)2(tr  t2K 

T \ 2(1  - Xt2  1 - Xt2  + 2(1  - Xt2  / 


T +t 


, 1 T.  T.+t.  1 

+ F^xT  l <\  “hr  > 

1 At2  r-0  r r 


(4.  79) 


for  Tj  + < tj  and  Is  given  by 


E(W)  - c 


T1 


1 v / r A(Trrti>  (“  A(T1  - rti»U 
“ I Z < Z e A i 1 

r-0  u«0  u * 


1) 


+ S V 


-ri 


At, 


At  + 2> 

2 2(1  - At2)2 


Atl  *1 

- * ( 7-  + 


'1  - At 


2 2(1 


" A*2>  J 


+ P, 
T 2 


f A(vt2)  yf_t 
[ ll  - At 


At, 


1 - At_  + 


2 2(1  - At2)i 


At, 


2 2(1  - At 


A 


T1  + C1 

+ u2  °2<T^tr + 


At, 


2 2(1  - At2)' 


♦ .wV  - »<w2  v* . «'2>2  <h  - v\ 

1 ^ 2(1 -»t2)  + + — (1 . )t ) ) 

m t a.  ^ » 


T +t 

l-T1! 

1 t.  T,+t, 


1 - At 


2 r-0 


I a.,1  - h'1 

—A  1 ^ 


(4.80) 


fWr  ‘l  < T1  + C2  where  Gl»  G2»  Gt  **“*  c given  by  (4.38),  (4.39), 

T 

(4.40),  and  (4.41),  respectively  and  l^1  is  given  by  (A. 72)  in  the 

Appendix. 
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Proof 


The  mean  associated  with  F(W)  is  given  by 


f 

J0 


E(W)  « lim  | y dF(y) 


Let  u(x)  be  the  unit  step  function  defined  by 


u(W)  -< 


1 , if  W >_  0 

0 , if  W < 0 


Then,  from  (4.81)  E(W)  can  be  expressed  by 


E(W) 


rTi  rw 

■ y dF(y)  - T F(T  ) + lim  y d[u(y  - T.)  F(y) ] 

J0  1 1 W-~>  Jo  1 


Integrating  by  parts,  E(W)  is  expressed  by 


E(W) 


“ “ [ 1 F(y)  dy  + lim  [ y d(u(y  - T ) F(y)) 

Jo  1 


From  (4.36)  and  (4.37),  for  0 <_W  <_T,  F(W)  is  given  by 


F(W)  - c G?  (W)  eXW 
C1 

So,  using  the  expression  for  G°  (W),  (A. 16),  the  first  term  of  ( 

C1 

is  given  by 


c 
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(4.81) 

(4.82) 

(4.83) 

(4.84) 

(4.85) 
.84) 


) 


fT!  f*,  ' y^'l1  (-  My  - tt.))r  My-rt.) 

P(y)  dy  - c 1 l 1—  . * d, 

J0  J0  r-0  r 


■ s 'N'1'  < f .<(T‘'rti)  >(T>  - ».»“  „ 

A r-0  u-0  u!  " } * 


T T T T 

Now,  define  the  functions  H*  (x),  H 1 2(x)  and  H 1 2<x)  by 

*2  ^ © 

■!  (x)  - f y d(eAy  G*  (y)) 

C2  Jo  t2 

^1^2  [x  Xv  ^ o 

“G  (X)  " J0  y d(e ^ {GG  (y)  - GG2(y)» 


(4.86) 


(4.87) 


(4.88) 


and 


T1T2  fx  Xx  -X(T,-T  ) 

He  <*>"J  y d(eAx{Ge(x  - Tx)  - e 2 1 G#(x  - ty})  . (4. 


-Xd.-T,) 


89) 


So,  substituting  the  expression  for  F(W)  from  (4.36)  and  (4.37)  into 
(4.84)  and  using  (4.87),  (4.88)  and  (4.89),  the  expression  for  E(W) 
becoaes 

, 'V'l1  r MT -rt  ) <-MT  - «.)>“ 

E(W)  » c - y l (l  . 1 1 L-;-! „ 

r-0  u-0 

-1  Ti+t2  Tl+tl  Ti+ti 

+ G |i  » ilia  H 1 2(x)  - lia  R 1 X(x)J  + w,G,  lia  H 1 X(x) 

— 2 x~  f2  2 2 *2 


(4.90) 


XT1  Tl+t2»Tl+tl  Tl*Tl+tl 

- GT  e 1 lia  HeX  2 1 1 + lia  HQl  1 1(x) 
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'Ztaaat 


for  + t2  <_  tj  and  becomes 


E(W)  - c 


, r X(T  -tt.)  (-i(T.  - «,))“ 

-I  I ( l « 1 1 1...  1 - 1) 


r“0  u-0 


U! 


T t 

+ 11m  H X(x)  - 11m  H X(x)  + G 
x*~  C2  2 


P-X^lim  H 1 
Lxx»  c2 

Ti+t  "J  XT.  ^l^t2,^l^tl 

- 11m  H 1 V)  - G_.  e 1 11m  H 1 2 1 X( 

x^»  C2  J x*» 


(x) 


tl’^l+tl  "^l"*"^l 

+ 11m  H 1 1 A(x)  + p G2  11m  H 1 A(x) 

x^»  x+“  Z2 


(4.91) 


t ^1*^2 

for  t,  < T,  + t_.  The  limit  of  the  functions  H (x),  H (x)  and 

T.T,1  12  *2  C 

H z(x)  are  evaluated  In  the  Appendix  in  (A.67),  (A. 71),  and  (A.76). 
e 

Hence,  upon  substituting  |Jbese  limiting  values  Into  (4.90)  and 
(4.91),  the  results  of  the  theorem  are  obtained. 

Q.E.D 


So,  using  the  expressions  derived  for  F(W)  and  F(W)  dW  In 

2 0 
(4.23),  }({Dk>)  Is  evaluated.  Figure  4.1  sunnarizes  this  com- 

putation. In  Section  4.5,  these  techniques  for  the  evaluation  of 

2 1 
L(T  are  used  to  obtain  numerical  results  for  y2(D). 


4.4  Approximation  of  the  Delay  Distortion  Relationship 

In  the  previous  section,  the  Integral  equation  (4.24)  for  the 
limiting  probability  distribution,  F(W),  of  the  waiting  time  of  a 
message  In  a data  compression  scheme  using  two  data  compressors  (K-2) 


is  solved  explicitly  for  F(W)  . This  allows  to  *>e  evaluated. 


For  K greater  than  two  the  corresponding  integral  equation  (4.24) 

can  be  solved  by  the  usual  numerical  techniques  for  solving  integral 
K 1 

equations.  Then  ^({0^})  can  be  evaluated  and  can  be  computed 

K 

using  (4.7). 

In  this  section,  an  alternative  technique  for  evaluating 

K 

L{T  *8  Pre8ente(1  which  does  not  rely  on  numerical  techniques 

and  their  Inherent  convergence  difficulties  to  solve  for  F(W).  The 
technique  evaluates  L^T  j({D^})  by  considering  an  appropriate  imbedded 
Markov  chains  when  the  squences  (T^,  k - 1,  ...,  K-l)  and 
(t^,  k - 1,  2,  K)  have  a common  divisor  denoted  by  A (l.e., 

dividing  each  term  of  the  sequences  by  A yields  an  integer).  Then 
^(D)  is  evaluated  using  (4.7)  where  the  minimum  operation  is  taken 
over  a dense  set  of  values  of  (D^,  k - 1,  2,  ...»  K)  and  a dense  set 

of  values  of  (T^,  k - 1,  2 K-l)  (l.e.,  the  corresponding  set 

of  values  (t^,  k - 1,  ...,  K}  and  the  set  of  values 
(Tfc,  k-l,  ...,  K-l)  have  a common  divisor). 

In  order  to  define  the  Markov  chains  used  in  the  evaluation  of 
L{t  }({\})»  define  the  Integers  (Jl^,  k - 1,  2,  ...»  K)  and 
(m^,  m - 1,  2,  ...,  K-l)  by 


t 

k A 


k - 1,  2,  ....  K 


(4.92) 


. .h 

“k  A 


k-l,  2,  ....  K-l 


(4.93) 


Then  for  the  queueing  system  with  thresholds  {T^>  and  service  time 
{t^},  the  virtual  waiting  time  at  an  instant  nA  units  of  time  following 
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the  initiation  of  a busy  period,  is  an  integer  multiple  of  A.  So, 
define  the  sequence  of  imbedded  times  (t  },  n >_  1,  by 


£ + A 

n 


, if  £ is  not  the  end  of  a busy 
n . . 
period 


(4. 94) 


\max{t  : t.  > t ,j  > 0),  otherwise 

V bj  bj  - 

where  t,  • t.  , and  t.  is  the  time  at  which  the  1 n busy  period 

1 -b1  bj 

begins.  Thus,  if  V(t)  is  the  virtual  waiting  time  of  the  queueing 
system  with 


V(0)  - 0 


(4.95) 


then  the  sequence  {x^}  defined  by 


x„  - V(S  )/A 
n n 


n ■ 1,  2,  «.i 


(4.96) 


is  an  Imbedded  Harkov  chain. 


Clearly,  x can  be  written  as  a function  of  x . and  the  number 
n n-1 

of  messages  N arriving  In  the  Interval  (t  -,  t ].  Hence,  if 
n n-1  n 

x^_^  ■ 0,  then  is  the  beginning  of  a busy  period  and  so,  - 1 
and  xn  ■ When  xn_^  > 0,  the  virtual  waiting  V(?n)  is  given  by 


V(t  ) - V(£  .)  - A + S 

n n-1  n 


(4.97) 


where  Sq  is  the  sum  of  the  transmission  times  of  messages  arriving 

in  the  interval  (£  . , £ ].  To  compute  V(l  ) or  equivalently  x , 
n-i  n n n 

let  Z(N,1)  be  the  value  of  x when  x - - i > 0,  and  N messages  arrived 

n n— i 

in  the  interval  (£  ^,  tnl»  So.  using  the  normalised  thresholds  {m^}. 


Z(N,i)  can  be  computed  recursively  by 


Z(N,i)  - Z(N-l,i)  + *k  , if  mk_l  < Z(N-l,i)  - 1 < n^,  k - 1,  ....  K 


(4.98) 


where  Z(0,1)  ■ i-1.  Therefore  for  x , > 0, 

n-1 


x - Z(N  x ) 
n n n— l 


which  is  computed  using  (4.98). 

To  compute  the  transition  probability 


(4.99) 


Pj|l  ' Pr<xn  ' JIVl  ‘ « 


(4.100) 


first  note  that  Pr(NR  - Nlx^  > 0),  denoted  by  P(N),  is  given  by  a 
Poisson  distribution,  and  so 

vN 


Pt0,„  • “I  Vi p 0)  ■ *'M  • 


(4.101) 


Hence , 


‘Jli 


i - 0 


1 , if  j 

P(N)  , if  j - Z(N,i),  i > 0,  N - 0,  1,  2,  3, 
0 , otherwise 


(4.102) 


Now  if  Tr-1««»  and  Xtfc<  1.  then  it  can  be  shown  that  the  Imbedded  Markov 

chain  (x  ) is  aperiodic  and  positive  recurrent.  Thus,  lim  Pr(x  - j), 
“ _ n 

which  is  denoted  ir  , satisfies 


"l  ■ Jo  PJU  ’1  • 


(4.103) 
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- — 


and 


l "j  - 1 . 


J-o 


(4.104) 


To  relate  the  Markov  chain  {x^}  to  the  calculation  of  LV^  ^({D^}), 

kj 

a second  Markov  chain  {y  } needs  to  be  considered  where 

n 


y m (*  . , x ) n ■ 1,  2,  ...  . (4.105) 

n nri  n 


The  transition  probability  of  {y^}  Is  given  by 

P(J1.J2lil»i2)  " " <Jl’j2)|yn-l  " (11*12)) 

- Pr(xn  - iv  xn-1  - j2|xn_1  - 1,  xn_2  - 12) 


(4.106) 


Using  (4.100),  P . i.  . is  expressed  in  terms  of  P. i.  by 
^Jl»J2l 11»12/  3I1 

« U - i. 


“ioJl  11 


'2  1 
otherwise 


(4.107) 


Then  lia  Pr(y  - (j.,J,)),  which  is  denoted  w.  . , is  given  by 


?r(j1.32)  " PJX|J2  *J2  * 


(4.108) 


To  coapute  L._  . ( { D } ) , functionals  of  the  Markov  chain  {y  } are 
IT.  f k n 

now  defined.  Let  N* (yn)»  k ■ 1,  ...,  K,  be  the  number  of  aessages 

which  are  coapressed  by  compressor  k during  the  time  period  (t  , 

n 

t ...  1.  It  is  easy  to  show  that  N (y  ) as  defined  is  a function  of  x 
nr  i n n 

and  and  hence  Is  a function  of  yR.  So,  N^(y)  is  coaputed  as 
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follows: 


| 1 , if  and*  k ■ 1 

1 0 , otherwise 


(4.109) 


N (j-l,j)  - 0 


for  all  j > 0 and  for  k « 1,  2,  ...»  K 

(4.110) 


and  for  j2  > 0,  and  j 1 >_  j 2 “ 1 
' k 


» <Ji + vV  - 


>(VV*i  • lf  pj1|j2  * 0 and  Vl  * J2i\ 

• l£  PJ1lJ2  - 0 

, otherwise  . (4.111) 


0 


N (j1,J2) 


Then  let  N(y  ) be  the  total  number  of  arriving  messages  In  the  Interval 
n 


(t  .ft],  which  Is  given  by 
n-i  n 


N(y  ) “ l « (y  ) . 


(4.112) 


k-1 


Now  define  the  functional  W(y  ) to  be  the  sum  of  the  waiting 

n 


times  of  messages  arriving  between  (t  t ].  So,  W(y  ) can  be 

n-i  n n 


expressed  In  terms  of  (N*(y  )}  as  follows: 

n 


, if  J,  “ 0 


- { o 


, If  j2  > 0,  «nd  Jx  - j2  - 1 
K (Nk(J.,j,)  - 1)  Nk(j  ,J  ) 

*«!-J2)«2  ' *5)A  + ^ — V 


N 

♦ I 

k-2 


k 

I 


k *1 
Ki 1 


Vv  »k(J1.l2)vi‘ 


, otherwise 
(4.113) 


Using  the  Markov  chain  {y^},  the  following  theorem  shows  that 


W « lim  N-1  E{  l (w  )} 


P “ lim  N_  l Pr(T  < w < T ) 

* n-0  k 1 n ~ k 

can  be  expressed  in  terms  of  the  mean  of  the  functionals  of  the 

Markov  chain.  Hence,  L^T  ^({0^})  can  be  evaluated  from  (3.15). 


Theorem  4.4 

Consider  an  adaptive  data  compression  scheme,  which  uses  K data 
compressors,  where  the  greatest  common  divisor  of  the  transmission 
times  { t^}  and  thresholds  is  A.  Then  W and  P,  are  given  by 


l W(j  ,j  ) S 
jlJ2  Jl' 

l N(j1,j2)  * 

J1J2  1 ‘ 

l Nk(J  ,J  )S 

J1J2  3lJ; 

I N(J  ,j  W 

J1J2 


From  the  generalization  of  Leona  3.10,  it  is  clear  that 


W - lim  E(w  ) 
n 


If 


Pk  " llB  Pr(Tk-l  <wn-V  * k - 1,  ....  K. 


(4.119) 


Using  the  ergodlc  theorem  (Doob  [38]),  (4.118),  and  (4.119),  It  is 
clear  that 


lim  N_1  l w - W 
N-*°°  n-0  n 


w.p.l 


n-1 


where  I.  (•)  is  the  indicator  function  defined  by 


V»>  • 


1 , If  < w < Ik 


0 , otherwise 


Now  let  be  given  by 

i 

Mi  “ l N(V  • <*• 

n-1 

So,  from  the  definition  of  the  sequence  {tn>,  (4.95),  is  lower 
bounded  by  considering  the  situation  where  every  arrival  initiates 
a busy  period  and  so. 


Mi^I7TT 


Thus, 


l w<yJ  l « 


I Nk<yn>  I W 

i-l " n-l  k n 


k - 1 N . (4.126) 


Pros  the  bound  (4.125).  and  from  (4.120)  and  (4.121)  It  is  clear 
that  the  limit  of  (4.125)  and  (4.126)  are  given  by 


l W(y  ) 

“■anr 5 

i-MO  i 


v.p.l 


(4.127) 


i 


. as . 

i-»-  i 


w*p*l  | k ■ 1|  •••!  K#  (4*128) 


Nov  from  an  ergodlc  theorem  in  Chung  [39], 

i I 5n  W(j  ,J  ) 

I W(yj  , 4 J1J2  1 2 

n,  ari.  1 - hh 

i-*-  i l % 4 »<W 

JlJ2  3132 


w.p.l 


(4.129) 


k I ’'n  Nk(J  ,j  ) 

i Ay _)  J,J2  jiJ2 

11*  l _H LI- 

i-n»  n-l  i l n . H(JltJ2) 

Jr J2  3132 


w.p.l,  k — 1,2, ...,K  . 

(4.130) 


Hence,  substituting  (4.127)  and  (4.128)  into  (4.129)  and  (4.130), 


the  expressions  (4.116)  and  (4.117)  are  proved. 


Q.E.D 


Using  (4.116)  and  (4.117),  W and  {Pfc}  can  be  computed  once 


V:  . ' 7> 


i 


V"T" 


if.  . is  computed  or  once  w . / ir  Is  computed.  From  (A.  110), 
J i J •»  J jJ  2 ° 


« 1 2 

j /no  i8  deternlne<1  from  'fy.  J * °»  1*  ••• 

(4.102) 


Now  clearly  from 


Pj|i“° 


for  i and  j that  satisfies  j + 1 >_  i >.  0.  So,  is  given  by 

solving  (4.103)  in  a recursive  manner.  Thus, 


where  — ■ 1. 

*0 

Hence,  U and  can  be  computed  using  (4.116)  and  (4.117), 

£ 

respectively,  and  j({Dfc})  can  be  evaluated  using  (3.15). 

Figure  4.2  summarizes  this  computation.  In  Section  4.5,  these 

techniques  for  the  evaluation  of  j ({D^})  are  used  to  obtain  numerl- 

1 ^ 
cal  results  for  YR(D),  K > 2. 


K.  |Tk|  . |Ok|  .m 


INITIALIZATION 

If  (Dk)  ♦PfcJ  C'1,k-  1. ...  K 

GCD  ({«„[.  \\\) 
ya  , k - 1, ...  K,  (4.92) 
TkM  , k - 1, ...  K,  (4.931 


INCREMENT  J 
J-J  + 1 


COMPUTE 

PJ|i,l-0....J+1  , (4.102) 

. (4.131) 

W(J.0,I-O....J  ,(4.113) 

Nk  U.I),l-0,.,J.k-1.  ...K 
(A  100).  (4.111) 


FORM  PARTIAL  SUMS 

w 

- 

w^wU'OPj,,*, 

n“ 

- 

W 

■ 

k l 

W £ Nk 
k-1 

.(4.116) 

pk 

■ 

k k 

Nk/  £ Nh 
k-1  k 

, (4.117) 

"•Pk\ 

VVERGE. 


^Tk},{°kl»-W\|1‘«k^Ok)Pk.(3.1B) 


Fi«urt  4.2.  Evaluation  of  L^W  i ( jDk}  ),  K>2 


4.5  Nuaerlcal  Results 

To  illustrate  the  performance  Improvement  when  using  the  adaptive 
data  compression  scheme  as  compared  with  the  use  of  the  nonedaptive 
scheme,  a single  channel  communication  system  is  considered.  This 
comparison  is  performed  by  evaluating  the  delay  distortion  relation- 
ship for  the  different  schemes.  The  source  and  channel  model  to  be 
employed  are  described  in  Section  3.2.  The  message  arrival  rate  is 

and  the  channel  transmits  Information  bits  at  a rate  C[— ta] . 
sec.  sec. 

Each  message  consists  of  a collection  of  v letters  which  are  lndepen- 

2 

dent  normally  distributed  random  variables  with  variance  a*.  The 

data  compressors  used  in  the  compression  schemes  under  consideration 

satisfy  the  rate  distortion  relatlonhlp  r(D)  . Using  a per- 

mess 

letter  squared  distortion  criteria,  r(D)  is  given  by  (see,  Berger  [4]) 


r(D)  - v/2  in(oVD) 

The  protocol  information  parameter  p„[^^l  is  given  by 

K 10688 


pR  " tn  K 


where  K is  the  number  of  data  compressors  used  in  the  compression 
scheme. 

Using  the  models  described,  the  delay  distortion  relationships 

Y^(D)  for  the  nonadaptive  compression  scheme,  K-l,  and  for  the 

adaptive  compression  scheme,  K-2,  and  3,  are  evaluated  using  (3.10) 

and  (4.7),  respectively.  In  the  evaluation  of  y*(D),  K - 2,  3,  the 
2 

quantity  j^D^})  is  computed  using  the  technique  described  in 
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w 


Figure  4.1  and  the  quantity  Ljt  ^({D^})  la  computed  using  the  technique 
described  In  Figure  4.2.  The  optimization  required  in  the  evaluation 
of  y*(D)  1*  performed  numerically  using  a numerical  optimization 
algorithm.  Figure  4.3  presents  curves  for  y*(D),  K - 1,  2,  3. 

Figure  4.4  shows  the  relative  performance  of  the  nonadaptive  scheme 
versus  the  adaptive  scheme  by  presenting  the  curves  yJ(D)  - y*(D), 

K - 2,  3. 

From  the  figures  It  Is  evident  that  at  low  distortion  levels  the 
adaptive  schemes  yield  considerable  smaller  message  delays  than  those 
obtained  when  employing  nonadaptive  schemes.  At  high  distortion 
levels,  nonadaptive  schemes  are  slightly  superior  to  the  adaptive 
schemes,  due  to  the  required  transmission  of  protocol  Information 
for  the  adaptive  data  compression  schemes.  The  lower  envelope  of 


the  curves  In  Figure  4.3  represent  a restricted  version  of  ya(D) 
defined  In  (3.14)  where  only  schemes  with  one,  two,  or  three  data 
compressors  are  considered  In  the  calculation  of  y^(D) • 

Thus,  from  this  example,  it  is  concluded  that  the  adaptive  data 
compression  schemes  considered  here  serve  effectively  as  a device 
to  reduce  message  delays  when  low  distortion  levels  are  required. 

The  situation  of  low  distortion  levels  corresponds  to  heavy  traffic 
In  the  coMunlcation  network.  As  the  distortion  levels  Increase, 
message  delays  associated  with  the  corresponding  adaptive  compression 

schemes  are  slightly  degraded  when  compared  with  those  obtained  by 

* 

nonadaptive  schaaws.  This  Is  due  to  the  Increased  traffic  resulting 
from  the  transmission  of  protocol  Information  when  using  the  adaptive 

scheme. 
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Figure  4.1  and  the  quantity  ^({D^})  Is  computed  using  the  technique 
described  In  Figure  4.2.  The  optimization  required  In  the  evaluation 
of  y^(D)  performed  numerically  using  a numerical  optimization 
algorithm.  Figure  4.3  presents  curves  for  y^(D),  K - 1,  2,  3. 

Figure  4.4  shows  the  relative  performance  of  the  nonadaptlve  scheme 
versus  the  adaptive  scheme  by  presenting  the  curves  y*(D)  - y*  (D) , 

K - 2,  3. 

From  the  figures  It  Is  evident  that  at  low  distortion  levels  the 
adaptive  schemes  yield  considerable  smaller  message  delays  than  those 
obtained  when  employing  nonadaptlve  schemes.  At  high  distortion 
levels,  nonadaptlve  schemes  are  slightly  superior  to  the  adaptive 
schemes,  due  to  the  required  transmission  of  protocol  Information 
for  the  adaptive  data  compression  schemes.  The  lower  envelope  of 
the  curves  in  Figure  4.3  represent  a restricted  version  of  ya(D) 
defined  in  (3.14)  where  only  schemes  with  one,  two,  or  three  data 
compressors  are  considered  In  the  calculation  of  ya(D) . 

Thus,  from  this  example,  it  Is  concluded  that  the  adaptive  data 
compression  schemes  considered  here  serve  effectively  as  a device 
to  reduce  message  delays  when  low  distortion  levels  are  required. 

The  situation  of  low  distortion  levels  corresponds  to  heavy  traffic 


In  the  comunlcation  network.  As  the  distortion  levels  Increase, 


isage  delays  associated  with  the  corresponding  adaptive  compression 


are  slightly  degraded  when  compared  with  those  obtained  by 


nonadaptlve  schemes*  This  Is  due  to  the  Increased  traffic  resulting 
from  the  transmission  of  protocol  information  when  using  the  adaptive 


schema. 
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DISTORTION  D/a2 
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4.6  Conclusions 

The  delay  dlatortlon  relatlonahlp  for  a communication  system 
utilizing  a 8 Ingle  channel  and  an  adaptive  data  compression  scheme 
was  considered  In  this  chapter.  The  delay  distortion  relationship 
was  shown  to  be  specified  by  a minimization  of  a functional  over  a 
vector  space.  Basic  properties  of  this  relationship  were  presented. 
The  functional  to  be  minimized  was  then  examined,  and  an  Integral 
equation  whose  solution  is  related  to  this  functional  was  derived. 

For  the  case  of  an  adaptive  data  compression  scheme  which  uses  two 
data  compressors,  this  integral  equation  was  solved  and  the  functional 
evaluated.  For  the  case  of  compression  schemes  which  use  more  than 
two  data  compressors,  a technique  was  developed  to  evaluate  the 
functional  by  defining  an  appropriate  Markov  chain  and  functionals 
of  the  Markov  chain.  Finally,  examples  of  the  delay  distortion 
relationship  for  a communication  network  employing  a single 
communication  channel  were  presented.  From  the  examples,  it  was 
observed  that  an  adaptive  compression  scheme  yields  superior  per- 
formance to  that  of  a nonadaptive  scheme  at  low  distortion  levels. 


! 
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CHAPTER  V 

THE  DELAY  DISTORTION  RELATIONSHIP  FOR 
TANDEM  CHANNELS 


The  results  in  Chapters  III  and  IV  pertain  to  a communication 
network  which  uses  a single  conmnmicatlon  channel  for  data  trans- 
mission. In  this  chapter,  these  results  are  extended  to  communication 
networks  employing  tandem  channels  where  the  channel  capacities  are  all 
equal.  In  Section  5.1,  the  tandem  channel  network  is  discussed  and  the 
delay  distortion  relationship  for  this  network  is  determined.  The 
resulting  delay  distortion  relationship  is  shown  to  be  of  the  same  form 
as  the  delay  distortion  relationship  for  the  network  consisting  of  a 
single  channel.  In  Section  5.2,  some  of  the  properties  of  the  delay 
distortion  relationship  outlined  in  Section  5.1  are  presented. 

Finally  in  Section  5.3,  numerical  results  for  the  delay  distortion 
relationship  for  various  tandem  networks  are  presented. 

5.1  Delay  Distortion  Relationship  for  Tandem  Channels 

In  this  section,  the  delay  distortion  relationship  is  investi- 
gated for  a network  configuration  consisting  of  a tandem  channel, 
as  described  in  Section  3.1;  a source,  as  described  in  Section  3.1; 
and  an  adaptive  data  compression  scheme,  as  described  in  Section  3.2. 
Incorporating  the  results  of  Chapters  III  and  IV,  this  relationship 
is  shown  to  have  a form  similar  to  the  delay  distortion  relationship 
for  the  single  channel  network.  Thus,  the  results  of  this  section 
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will  closely  follow  the  results  of  the  previous  chapters. 

The  network  under  consideration  consists  of  M channels,  with 

Identical  channel  capacities  equal  to  C [nats/sec],  connected  in  a 

tandem  manner  as  depicted  in  Figure  3.1.  For  this  network,  and 

for  the  adaptive  data  compression  scheme  which  uses  two  data  com- 

M 

pressors,  the  delay  distortion  relationship,  Y2(°)»  is  given  by 

(3.13).  As  in  Section  3.2,  if  the  Infimum  in  (3.13)  can  be  replaced 

M 

with  the  minimum  operation,  the  expression  for  Y2(d)  reduces  to 
(3.16).  The  term  L^({Dk>)  in  (3.16)  is  expressed  using  (3.11), 


(3.12)  and  (3.15)  as 


N-l  n 


l/j({Dfc})  - lim  8 up  N_1  ][  Ew{  J lw„  „ + (*(d„)  + P?)c  *1  + 


n-0  n”l 


(5.1) 


where  n is  the  compression  assignment  policy,  {D^}  Is  the  average 

distortion  levels  of  the  data  compressors  used,  w is  the  waiting 

n,a 

time  of  the  n^  message  In  the  buffer  associated  with  the  m^  channel 

with  wA  - 0 for  m - 1,  2,  ...»  M (network  initially  empty),  d 
u,m  n 

is  the  average  distortion  associated  with  the  data  compressor  which 
compresses  the  nth  message,  u is  the  associated  Lagrange  multiplier, 
r(D)  is  a rate  distortion  relationship  that  the  data  compressors 
satisfy  and  p^  is  the  protocol  information.  Without  loss  of  generality 
assume  <_  D^,  for  the  remainder  of  this  chapter. 

To  determine  if  (3.16)  is  valid,  it  is  necessary  to  investigate 


L ({D.})  - inf  Lff({Dk)) 
sell 


(5.2) 


f 


and  ascertain  if  there  exists  a ir*  c II  which  satisfies 


L2({Dk»  - L2*({Dk»  (5.3) 

where  II  Is  the  class  of  causal  policies.  As  in  the  case  of  the 

single  communication  channel  in  Chapter  III,  the  verification  of 

(5.3)  is  accomplished  by  Introducing  the  notion  of  a Markov  decision 

process  for  the  tandem  conmunicatlon  channel,  and  then  determine 

w*  as  in  Section  3.4.  Once  (3.16)  is  verified,  optimization  tech- 

M 

niques  are  used  to  find  y2(D). 

The  Markov  decision  process  {((x  , a ),  n * 0,  1,  2,  ...}  which 

n n 

is  considered  has  a state  x described  by  three  elements.  The  first 

n 

element,  to  be  henceforth  denoted  as  w , is  the  waiting  time,  w ,, 

n n,l 

of  the  n1*1  message  in  the  buffer  associated  with  the  first  channel. 

The  second  element,  denoted  i , consists  of  the  maximum  of  I,  and  of 
the  stas  of  the  idle  periods  of  channel  one  since  the  last  transmission 
of  s message  compressed  by  data  compressor  one  where 

IA  - (M-l)  (tx  - t2)  (5.4) 

and  tk  Is  the  transmission  time  of  a message  compressed  by  data 
compressor  k which  is  given  by 

tk  - (£(Dk)  + p2)c"1  . (5.5) 

The  last  element  of  x , denoted  t , consists  of  the  lnterarrlval 

n n 

time  between  the  (n-l)sC  message  and  the  nth  message.  So, 

x • (w  , 1 , x ) , end  the  state  space  is  X - ([£,•)  x [0,1.)  x [0,»)). 
n n n n i 

The  action  at  the  nth  decision  epoch  is  a e [(1),(2)]  which 

n 


3 


) 


) 


3 
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corresponds  to  using  data  compressor  1 or  2 on  the  n message. 
Then  the  history,  denoted  H^,  of  the  Markov  decision  process  up  to 
the  arrival  of  the  nth  message,  is  given  by  the  sequence  of  states 
and  actions  as 


Hn  " (V  V V al*  V ®n)  * 


(5.6) 


The  history  pattern  Hn  may  not  be  the  complete  history  of  the  com- 
munication system.  A sufficient  condition  for  to  be  the  history 
of  the  communication  system  is  that  the  state  specifies  the  waiting 

times  w , m - 1,  . ..,  M of  the  n^  message  at  each  of  the  M channels. 
n,m 

The  following  lemma  proves  this. 


Lemma  5.1 

Let  the  initial  waiting  times  at  the  various  nodes  aatlafy 


V ■ V 

0,1  0 


(5.7) 


and  for  m - 2,  ...,  M satisfy 


■ 0 

0,o 


(5.8) 


(0,  tj  - t2  - max(0,  in  - (m-2)  (t,  - t,)> 


1 2 


(5.9) 


where  the  initial  state  of  the  Markov  decision  process  is 

Xq  « (Wq,  1^,  Tq).  Then  the  waiting  times  at  the  various  nodes  are 

given  for  n > 0 by 


) 


w 


n,l 


(5.10) 


and  for  m ■ 2,  . ...  M by 


!0  .if  an  - 1 

max[0,  ~ t2  - max(0.  ±n  - (m^Htj  - t2>)]  .if  aR  « 2 


(5.11) 


where 


4 


l 


.if  a ■ 1 
* n 


(5.12) 


mln[I, , i + max(0,  t - w , - t„)]  .if  a - 2 
I n * n n-1  2 n 


and  x * (v  , i , t ) is  the  state  of  the  Markov  decision  process, 
n n n n 


Proof 

The  proof  of  the  lenma  is  by  induction.  Suppose  (5.10)  and 

(5.11)  are  valid  for  the  n^*1  message.  By  the  recurrance  relationship 

for  waiting  times,  w ..  is  given  by 

trri.m 


nfl.m 


<°-  V—  * “ Ci> 


(5.13) 


where  t Is  the  lnterarrlval  time  between  the  arrival  of  the  (n-1) 
n 

message  and  the  nth  message  to  the  mth  channel. 

Firat  consider  the  case  a^(^  “ 1.  Clearly,  from  (5.12)  for 

m > 2 


st 


w + t < t, 
n.m  «n-  1 


(5.14) 


Independent  of  the  value  of  a,,.  Furthermore. 
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f 


r 


m 

V!  i 'l 


(5.15) 


since  the  transmission  time  of  a message  which  Is  compressed  by 
data  compressor  one  through  a single  channel  Is  t^.  So,  using 
(5.14)  and  (5.15)  In  (5.13) 


w “ 0 
n+l,m 


(5.16) 


for  m > 2. 


For  the  second  case  of  a ■ 2,  let  m > 2.  The  Interarrival 

n+1  * — 

time  r"  for  all  n > 0 Is  given  by 


t“  - i"-1  + t 
n n a 


(5.17) 


where  I Is  the  Idle  time  of  m channel  between  transmissions 
n 

of  the  (n-l)8t  message  and  the  nth  message.  But  I*+^  Is  given  by 


I*..  ■ max(0,  t®  - - t - w ) 
n+1  n+1  a n,m 

n * 


(5.18) 


Upon  substituting  (5.17)  Into  (5.18),  Is  given  by  the  recurrence 

relationship 


Xn+1  " max(0*  In+1  " wn,m  + t2  ~ 

So,  from  (5.17),  the  expression  for  becomes 


Vl  ' ■“<0-  W - - \ + t2)  + l2 


(5.20) 


Substituting  (5.20)  Into  (5.13)  results  In 


’ ■“<  "•  "...  + '.  ' '2  ' 
n 


W - "...  ' + *2»  • 

" (5.21) 


...  -V  • 


Mow  for  a ■ 1,  substituting  (5.11)  in  (5.21),  w , Is  given  by 
n nri,B 

’ “ax{0»  h “ C2  ” max[0»  Cl  _ (tl  " t2)I}  * (5‘22) 

But  from  (5.19)  for  a ■ 1 

n 

Ittfl  ” In+n  “ (■"D  (tj  ~ t2^ 1 (5.23) 


and  so,  upon  substitution  Into  (5.22), 


n+l,m 


{°,  tx  - t2  - aaxfl^  - (a-2)  (tx  - t2)]}  (5.24) 


For  a ■ 2,  substituting  (5.11)  and  (5.17)  into  (5.25),  w Is 
n n-nfm 

given  by 


nfl.a 


(0,  tj  - t,  - 


[0,  In  - W)  (t,  - «2)1  - £}) 


(5.25) 


where  I ^ is  given  by  the  solution  of  (5.19)  as 

Ci  ■ ■“(0-  Ci -Jm2  "»,.•>  • 


(5.26) 


Now  consider  three  cases.  First  let  ■ be  such  that  w ■ 0.  Hence, 

n.a 


w , - 0 for  n > a*  > 2 and  so 
n,a  ~ ~ 


n+l,a 


(0,  t,  - t,  - iax[0,  in  ♦ 1^  - (a-2)  (t,  - t,)]}  . 


(5.27) 


Then  let  » > 2 bt  the  saallest  integer  such  that  w * > 0.  So, 

n,a 

1 - (a~2)  (t.  - t.)  > 0,  thus,  w » satisfies  (5.27)  since 
n l / n*ri,a 

I^j  “ I^.  Finally  let  a > a.  Since  for  all  a'  > a , 

*n»a*  " *1  ’ *2*  Cl  U *lv<n  b* 
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(5.28) 


N 


( 


i ( 


c 


r 


m— 1 


Vi  ■ '“,l 


,<0-  Cl  • "n.S  ,L.  (tl  • t2,)  • 

* nr*m+l 

Substituting  the  values  of  w a into  (5.28),  I®,  is  given  by 

n,m  n+l 

Cl  • -“,0-  Cl  - [tl  - C2  - *n  + <S"2>  (tl  • t2)1  - ,L.,(tl  " '2s 


- max{0,  in  + IR+1  - (m-2)  (tj  - tj)} 


(5.29) 


Hence  substituting  (5.29)  into  (5.25)  and  noting  that  i - (m-2)  • 

n 

(t^  - t£>  £ 0,  expression  (5.27)  results.  Therefore  for  all  m >_  2, 

expression  (5.27)  is  valid.  Now  observe  that  in  (5.27)  i + I*  . 

n n+l 

can  be  replaced  by  max(I, , i + I*  ,)  without  affecting  the  value 

1 n n+i 

of  w . Clearly,  the  element  i . , of  x . , is  given  by 

nrl9ll  ltt\L  IVrl 


n+l 


Hence,  (5.11)  is  valid  for  n+l.  Therefore,  by  induction  since  w 


max(I]L, 

xifi> 

if  a - 1 

n 

max(Ilt 

i + I1..) 
n n+l 

if  a - 2 . 

n 

(5.30) 

0,m 


specified  by  (5.7)  - (5.9)  satisfies  (5.11),  then  (5.11)  is  valid 
for  all  n. 

Q.E.D 


Leona  5.1  shows  that  history  H^  is  the  coioplete  history  of  the 

communication  system  for  all  initial  conditions  of  the  form  (5.8) 

or  (5.9).  Certalntly,  the  initial  condition  requiring  all  the 

buffers  in  the  communication  system  to  be  empty,  which  is  assumed  in 
M 

calculating  Yj^),  satisfies  (5.8)  when  ip*^.  ‘Thus,  initial  conditions 
other  than  that  of  the  form  of  5.8)  or  (5.9)  need  not  be  considered. 
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Using  the  recursive  equation  (3.21)  for  w , and  the  recursive 

n 

equation  (5.12)  for  a version  of  the  transition  probability 
distribution,  Prfr^  1 W,  1^  <_  I,  < T|Hn)  Is  given  by 


Pr(wn+1  1 W*  £ I.  1 TIH„) 


n+1 


n+1 


’ Pr("n+1  i “•  Vl  i '•  Vl  i T|  <V  V V'  %' 

f 

P (min(I+w  +t  ,T))  - P (nax(0  ,w +t  -W))  , If  a -1,  I<I_ 

t ni  t na^  ' n — 1 

Jpt(T)  - PT(max(0  ,vn+ta^-W) ) , if  an-l,  I>11 

|PT(«ln(I-in+wn+t2,T))  - PT(»ax(0  ,wn+t2-W)),  if  an«2, 

Pt(T)  - PT(«ax(0  ,wn+t2~W) ) , if  an-2,  I>I 


1 * 
(5.31) 


The  rules  or  policies  ir,  associated  with  the  Markov  decision 
process,  are  specified  in  Section  3.3,  and  the  set  II  Is  a collection 
of  such  policies.  The  cost  function  c(x,a)  is  of  the  form 

!w  + Mt.  + yD.  , if  a * 1 

1 1 (5.3 

w + t2  + (M-l)t1  - 1 + pD2  , if  a - 2 


where  x - (v,1,t)  e X,  and  a e [ (1) , (2) ] . Hence  the  average  cost 

•(*»*)  under  policy  ir  c II  when  the  Markov  decision  process  (x  , a ) 

n n 

la  Initially  in  state  x e X is  given  by  (3.39)  to  be 

-1  N_1 

*(x,w)  - 11a  sup  N l Ea(c(x  , a ^_)|*n  - x)  (5.33) 

!+♦«  n-0  n n u 

and 
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) 


) 


I 


3 


3 


(Svitr- 


* 


r 


| ( 


C 


c 


0 


#(x)  « inf  $(x,ir)  . (5.3A) 

well 

Relating  this  to  the  communication  system  under  consideration, 
it  is  clear  from  Lemma  3.1  and  (5.32)  that 
M 

l w + Mt  + uD  - c(x  , a ) . (5.35) 

, n.m  a a n n 

n»l  n n 

Thus,  by  properly  initializing  the  Markov  decision  process  (x^,  a^), 

2 

Lij((D^})  is  clearly  given  by 

Ln((Dk})  " *<(0‘  h*  0)»  n)  * (5*36) 

Since  H is  the  complete  history  of  the  communication  system,  the  set 
n 

of  admissible  policies  II  is  the  same  set  of  policies  considered  in 
(5.2).  Hence,  L*({Dk))  is  given  by 

L2({DkJ)  - *((0,  I,  0))  . (5.37) 

Therefore,  the  problem  of  finding  a policy  ft*  which  satisfies  (5.3) 
reduces  to  the  problem  of  locating  a policy  n*  which  satisfies 

#(x)  - ♦(x,x*)  . (5.38) 

Before  determining  a it*  which  satisfies  (5.38),  the  value  of 
Dj  must  be  restricted  to  guarantee  that  t(x)  is  finite.  So,  as  in  the 
single  channel  network  case  in  Section  3.4,  is  restricted  to  the 
region 


d2  < r’hf  - p2) 


Outside  this  region  4(x)  is  unbounded. 


159 


To  find  it*  and  $(x),  first  consider  the  Markov  decision  processes 
{(*n»  *n)»  n ■ 0,  1,  2,  ...}  defined  as  before,  but  with  cost  function 
c(x,a),  given  by  (5.32),  replaced  by 

w + Mtx  + pDx  ,if  a - 1 

c'(x,a)  ■ w + t2  + (M-l)t^  + pD^.if  a ■ 2 and  1 • 0 

w + Mt2  + yD2  ,if  a - 2 and  1 > 0 

(5.39) 

where  x ■ (w,  1,  t).  The  associated  average  cost  is  denoted  ♦,(x,ir) 
and  *’(x)  is  given  by 


> 


"3 


J 


3 


) 


3 


I 


3 


3 


1 


t 


r. 


c 


c 


s 


f 


t 


t 


c 
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if  such  a policy  exists.  In  order  to  find  n'  c n that  satisfies 
(5.44),  Theorem  3.2  is  to  be  used.  The  following  set  of  lesaas  verify 
Assumptions  3.1,  3.2  and  3.3  which  are  required  to  apply  Theorem  3.2. 


Lemma  5.2 

Assumption  3.1  is  valid  for  the  Markov  decision  process  (x  , a ) 

n n 

under  consideration  with  costs  c'(x,a)  given  by  (5.39). 


Proof 

Let  m ■ 1, 


g((w,  i,  t))  - w + 1 (5.45) 

and 

b - max  t.  . (5. 46) 

ke[l,2]  k 

Then  (3.42)  and  (3.43)  are  easily  verified.  Hence  the  assumption  is 
valid. 

Q.E.D 


From  Theorem  3.1,  there  exists  a-optimal  stationary  deterministic 
policies  for  a c (0,1)  which  can  be  obtained  using  the  policy  improve- 
ment algorithm  of  Corollary  3.1.  The  following  lemma  establishes 
the  structure  of  the  a-optlmal  policy. 


1(1 


5.3 


The  a-optlaal  policy  for  the  Harkov  decision  process  (x  , a ) 

n n 

under  consideration  with  cost  function  c'(x,a)  given  by  (5.39) 
is  a connected  policy  where  action  one  is  selected  when  the  waiting 
time,  w,  and  the  truncated  sun  of  the  Idle  times,  1,  satisfy 


w < T (1) 
— a 


(5.47) 


and  action  two  Is  selected  otherwise.  TQ(1)  Is  termed  the  threshold 
function,  and  is  selected  to  minimize  the  a-discounted  cost  problem. 
Furthermore,  Tq(1)  Is  a constant  for  all  1 e (0,1^)  and  1^(0)  1^(1^) . 


Proof 

First,  as  In  Lemma  3.4,  one  can  show  that  the  dependences  on  the 
Interarrival  time  can  be  dropped.  So,  using  the  value  of  c' ((w,1,t) ,k), 
(5.39)  and  the  expression  for  pr(wn+1lw»  in+1<I,  Tn+1lTUwn»1n»Tn) »k> » 
(5.31),  the  policy  Improvement  algorithm  (3.45)  becomes  for 
(w,  1,  t)  c X 


°n+l  a**’1*  " *ln  (w  + 

”*1,a  ac[ (1) ,(2)] 


+ « r*  + ■ *•  "t(s> 

* “ 13k.  <-«!•  «.)  «-,<;>) 


(5.48) 


where 


:> 
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C 


C 


t 


c 


o 


- o , 

(5.49) 

I 

g(i»®)  “ ! 

ta  + wDa  + (M-l)tl 

9 

if  1 - 0 

1 

Mt  + pD 
a a 

9 

If  1 > 0 

(5.50) 

and 

(0  , if 

a - 1 

‘•’I.  , « 

a - 2 

(5.51) 

Now  define  0 (w,i)  by 

n,a'  * 7 7 

A fw 

r 

U (w,i)  - U 

n,a  JQ  n,a 

(w  - w,i)  dF  (w)  + U 

t In 

(0,min(I  ,i+w-w)) 
* 

dFr(w)  . 

(5.52) 

Thus,  (5.48)  becomes 

U “ ®ln  {w  + St1*®)  + a U „(w  + t » 16  )} 

It+1»°  ae[  (1) , (2)  ] n*a  a a 

(5.53) 

To  establish  the  limiting  form  of  U (w,l)  as  n -*•  ®,  it  Is 

n,a 

required  to  show  that  for  all  n and  for  all  o c (0,1),  u (w,i)  is  a 

n,a 

nondecreasing  function  of  w for  flxad  1,  and  is  a nonincreasing 

function  of  1 for  fixed  w.  First  define  F to  be  the  set  of  functions 

of  two  variables  of  the  form  f(w,i)  where  (w,i)  e ([0,®)  x [0,^]) 

such  that  f(w,i)  is  a nondecreasing  function  of  w for  1 fixed,  and 

f(w,l)  Is  a non increasing  function  of  1 for  w fixed.  Clearly, 

un  „(*#*)  E Now  suppose  U (•,•)  e F and  then  from  (5.52) 

wjO  n,a 
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0^  o(»,»)  e Thus,  from  (5.53)  Un+j  a(*,*)  e ^ f°r  ®d  n 1 ® and 
for  all  a e (0,1). 

Nov  to  show  that  the  a-optlmal  policy  Is  a connected  policy, 
define  for  w < 0 


U (w,i)  - U (0,  min(I, , 1 - w)) 


n,a 


n,a 


(5.54) 


and 


0 (w,i)  - 0 (0,  mind,,  1 - w) 

n,a  n,a  i 

Furthermore,  define  for  (w,l)  e d-",00)  x [0,1^]) 


(5.55) 


AU  (w,i)  - U (w  + t.,0)  - U (w  + t,,i) 
n,a  n9a  i tifOi  * 


(5.56) 


and 


A0  (v,i)  ■ 0 (w  + t.,0)  - U (v  + t.,1)  . (5.57) 

n9a  n,a  l n,a  l 

Clearly  from  (5.52),  (5.56)  and  (5.57),  HU  (w,i)  la  given  by 

n,a 


aO  (w,i)  - f AU  (w  - w,i)  F (dw)  . 
n,a  n,o  t 


(5.58) 


Thus,  from  (5.53),  for  the  (n+1)  stage  problem,  the  policy  at  the 
first  stage  Is  to  select  data  compressor  1 if  w > 0 and  1 e [0,1^] 
satisfies 


g(i,l)  - g(l,2)  + a AO  (w, i>  < 0 

II 


(5.59) 


and  use  data  compressor  2 otherwise.  So,  the  optimal  policy  at  the 
first  stage  Is  a connected  policy  if  AO  (w,i)  e F'  where  F'  Is  the 

Q|(l 
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set  of  functions  f(w,l)  defined  on  ((-«,<»)  x [0,1,])  which  are  non- 
decreasing functions  of  w and  1.  Now  define  F"  to  be  a subset  of 
F'  such  that  If  f(»,*)  e F"  then  for  a fixed  value  of  w e (-*,“) 

f(w,l)  Is  a constant  for  1 e (0,1,].  Hence,  if  AO  (•»•)  e F", 

i n,a 

(5.59)  Implies  that  the  optimal  policy  Is  a connected  policy.  To  show 
that  AU  (•,•)  e F"  for  all  n > 0 and  all  a e (0,1),  first  observe 

Il|Q  “ 

that  AUn  „(*»*)  e F"  and,  so  by  (5.58)  AU_  (*,•)  e F".  Now  suppose 

v jU  U f 01 

AU  (*,•)  c F",  and  hence  aO  (•,*)  e F".  This  Implies  that  at  the 
n,a  n,a 

first  stage  of  the  (n+1)  stage  problem,  the  optimal  policy  for  a dis- 
count factor  of  a c (0,1)  Is  a connected  policy  with  threshold  function 
Tn+1  o(1)  *PPlle<*  to  th*  waiting  time  w.  From  (5.59)  it  is  clear  that 

Vl,3(1)  iB  a constant  ^ (0,1,]  and  Tn+1  a(0)  i T^^d,). 

Using  (5.53)  in  (5.56),  AU^,  a(w,i)  for  (w,i)  e ((-•,«•)  x [0,1,]) 
is  given  by 


aUn+l,a(wftl'1)  - “Vl.a^’V  • lf  w " “fcl 


t -t  + aAU  (w+t-,0) 
1 Z n,a  X 


, If  w+t.  < T(0), 

A V ^ L*.  ^ m , 


0 <*w+t2  <_  T(i) 


2(trt2)  + p(D,-D2)  + I,u(-i)  + oAOna(w+t,,0) 


+ A0n  (w+t,»l) 
n,a  Z 


fu(D2  - D,) 


, if  wft  < T(0), 
w+t*  > T(l) 

, if  w+t  > T(0), 

0 <Aw+t2  < T(i) 


.(t.-O  + I.u(-i)  + aAU  (w+t- , 1)  , if  w+t.  > T(0) 

i 1 2 1 n,a  2 ^1  > T(1)» 


(5.60) 


» 


where  u(»)  is  the  unit  step  function  defined  by 


!0  , If  i < 0 

. (5.61) 

1 , if  1 >_  0 

Since  a(w,0)  is  an  increasing  function  of  w,  40^  a(*»*)  G F", 
and  Vi,  a(w,i)  is  a continuous  function  of  w,  then  it  is  clear  from 
(5.60)  that  AUn+^  a(*,*)  E Hence  by  induction,  AU^  a('**)  c F" 

for  all  n ^ 0. 

Now  from  (3.46)  the  a-discounted  cost  using  the  a -optimal  policy 
irj  is  given  for  (w,i)  e ([0,»)  x [0,^])  by 

Va((v,i),ir*)  - lim  U (w,i)  (5.62) 

n^<»  * 

where  the  dependences  on  the  interarrival  time  t have  been  dropped. 

Then  AVo((w,i),w*),  defined  by 

AVa((w,i),r*)  - Va((w  + tlt0),ir*)  - Va((w  + t2,i),n$) 

- lim  AO  (w,i)  , (5.63) 

* 

1*  a limit  of  functions  which  are  elements  of  F".  Hence, 

AVo((*, •),»*)  e F".  Furthermore,  from  (3.44)  and  (5.39)  ir*  is  he 
policy  which  selects  data  compressor  1 if  (w,i)  satisfies 

tl~  t2*  “^l  “ °2)  + 1iu<“i>  + 0 f AVa<w  “ fr»i)FT(dw)  < 0 

(5.64) 

and  selects  data  compressor  2 otherwise.  Since  AVa((», •),»*)  e F", 
expression  (5.64)  Implies  that  **  is  a connected  policy  with  a 


I 


threshold  to  be  labelled  as  T^d)  where  data  compressor  1 is  used  if 
(w,i)  satisfy  w <_  T^d)  and  data  compressor  2 is  used  otherwise.  In 
addition,  Ta(i)  is  a constant  for  i e (0,1^]  and  Ta(0)  21^(1^). 

Q.E.D 

Thus,  the  a-optimal  policy  is  a connected  policy  specified  by  a 
threshold  function  Ta(i).  The  following  lemma  proves  that  Ta(i)  is 
upper  bounded  for  all  a e (a*,l)  where  a*  e (0,1). 

Lemma  5.4 

Ta(i),  which  is  specified  in  Lemma  5.3,  is  bounded  for  all 
a e (a*,l)  for  some  a*  e (0,1). 

Proof 

The  proof  follows  from  a generalization  of  the  proof  of 
Lemma  3.6. 

Q.E.D 

Assumption  3.2  is  now  verified  in  the  following  lemma. 

Lema  5.5 

Assumption  3.2  is  valid  for  the  Markov  decision  process  (x  ,a  ) 

n n 

under  consideration  with  costs  c'(x,a)  given  by  (5.41). 

Proof 

In  this  proof,  the  Interarrival  time  is  disregarded  in  the 


n 
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description  of  the  state,  since  by  Lemma  5.3  Va(x,ir*)  is  Independent 

of  the  interarrlval  time.  Now  consider  the  following  two  cases; 

T (I,)  > 0 and  T (I.)  “ 0”.  For  the  first  case  of  T (I,)  > 0,  let 
ol  — a l a l — 

and  be  the  disjoint  regions  defined  by 

Rx  - {(w,i)  e ([0,-)  x [0])  u([0]  x 10,1^)}  (5.65) 

and 

R2  - ((w.i)  e ((0,-)  x (0,1^)}  . (5.66) 

Clearly,  if  (w  ,i  ) e R, , then  (w  ,i  .,)  e R . This  Implies  that 
n n l n+l  n+1  l 

if  (Wq*1q)  e R^»  then  for  all  n >_  0,  (w^.i^)  E Rj  and 

c'((w  ,i  ),a  ) - c'((w  ,0),a  ) . (5.67) 

n n n n n 


Furthermore,  if  (wn,i  ) e R,  the  sequence  {a  } is  unaltered  if  the 
U u 1 n 

policy  wj  with  threshold  function  Ta(i)  is  replaced  with  the  policy 
with  threshold  Tq(0)  for  all  i e [0,1^].  Hence,  the  a-optimal 
discounted  cost  Vo((w,i) ,n*)  for  (w,i)  e R^  is  given  by 

N 

V((w,i),x*)  - 11m  sup  { l a"  c* ((wn>0) ,an) |wQ  - w,  iQ  - i } 

N >0°  a n-0 


(5.68) 


Substituting  the  value  of  c'((w,0),a)  from  (5.39)  into  (5.68), 
Va((w,i),wJ)  satisfies 


V0«w,i),w*) 


_ 12=11  t 
1-a  i 


lim  sup  E { l an(w  +t  +pD  )|w  -w,  i -i). 
*o  n-0  n ®n  an  0 0 

(5.69) 


i 


1 
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Comparing  the  right  side  of  (5.69)  to  the  a-optimal  discounted  cost 
for  a single  channel  in  Section  3.4,  it  is  observed  that  the  right  side 
of  (5.69)  is  equal  to  the  a-optimal  discounted  cost  for  the  single 
channel.  Now  from  Lemma  3.9  there  exists  an  a*  e (0,1)  such  that 
for  all  a e (a*,l)  and  for  all  (w,i)  e R^,  a function  L^(w)  exists 
that  satisfies 

|Va((w,i),n*)  - Va((0,I1),1r*)|  < L^w)  . (5.70) 

Now  from  the  proof  of  Lemma  5.3,  Va((w,i)  ,tt*)  is  shown  to  be  a non- 
increasing  function  of  i e [0,1^]  and  a nondecreasing  function  of 
w e [0,®].  Thus,  (5.70)  Implies  that  for  all  (w,i)  e ([0,®)  x [0,1^]) 

|v0((w,i),TrJ)  - va((0,Ii),w*)|  < Li(w)  . (5.71) 

Now  consider  the  second  case  of  Ta(I^)  = 0~.  For  this  case  let 
Rj  and  R^  be  the  regions  defined  by 

R3  - (t0,«)  x (0,1^)  (5.72) 

and 

R4  - (t0,»)  x [0])  . (5.73) 

So,  if  (wn,in)  e R3,  then  (wn+1»in+1>  e R3*  This  implies  that  if 

(w  ,i  ) c R, , then,  for  all  n > 0,  (w  ,i  ) e R_,  a - 2,  and 
u u j — nnjn 

C,(<Wn,1n),an)  " c’  »2)  * (5.74) 

So,  for  (wn»in)  e r3  the  a-optimal  discounted  cost  Va((w,i),ir*)  is 
given  by 


I 


N 

V «wti),*J)  - 11m  sup  E { £ aD  c’((v  .I.),2)|w0  - w,  1 - I } 

N-k»  a n-0  n u 

N 

- 11«8UP  E^{  I o“(wn  + t2  + uDa)|w0  - v,  iQ  - Ij 
a n-0 

5 

+ (M-l)  t2/(l-a)  (5.75) 

where  the  sequence  of  waiting  times  (w^}  represents  the  sequence  of 

waiting  times  of  M/D/1  queueing  system  with  serve  duration  of  t2  and  | 

mean  interarrival  time  of  X-1.  Now  to  find  a bound  for 

Vo((w,l)  ,it*)  - V^O.I^)  ,ir*) , (w,i)  e ([0,“)  x [0,1^),  consider  the 

policy  iro  which  always  uses  data  compressor  2.  For  (w,i)  e the  r 

a-dlscounted  cost  is  the  same  under  policy  ir^  or  ir£  as  shown  In  (5.75). 

For  (w,i)  e the  a-discounted  cost  is  higher  under  policy  v than 
under  policy  ir£,  since  ir*  Is  the  a-optlmal  policy.  Thus. 

Va((w,i) ,7r*)  ” ^((O.Ii) tx*)  is  upper  bounded  by 

AVa(w,i)  - Vo((w,i),ira)  - Va((0.11)fita) 

> Va((w,i),ir*)  - Va((0,I1),ir*)  > 0 . (5.76) 

To  compute  AV  (w.i),  consider  the  sequences  of  waiting  times  out  of  a 

® » 

f 

M/D/1  queueing  system  given  by 

*2!  ’ “(0-  *“  * *2  - Vl>  1-1.2(5.77) 

1 2 

•here  w^  - 0,  w^  - w and  {tq}  la  a sequence  of  independent  and  ldenti-  . 
cally  distributed  exponential  random  variables  with  mean  X-1.  Thus, 

(5.77)  w > - wf>  0.  Furthermore,  if  w£^  - 0,  then 
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j ttiii— i aaiiin  1 1 


r 


I 


r t 


i o 


w(2)  - w^  for  n > m.  So,  using  these  relations  for  wf*^  and  the 
n n — » 

expression  for  c’((w,i),0),  AV  (w,l)  Is  bounded  by 


N -1 
w 


AV 


IB,  l - o'”  + (M-l)  <tx  - t2)) 

a n-0 

< {w  + (M-l)  (t  - t2)}  (Nw) 

a 


(5.78) 


where 


! < 


N - lnf{n:  w 

w n 


(2) 


0} 


(5.79) 


Setting  2 - 0 in  (3.152),  E (N  ) is  given  by 

IT  W 

a 


E (N  ) - w + X 

TT  W 

a 


-1 


(5.80) 


Hence  using  (5.76),  (5.78)  and  (5.80),  for  (w,i)  e ([0,»)  x [0,1^]) 


.-1, 


Va((v,i),x*)  - Va((0,I1),x*)  < [w  + (M-l)  (tx  - t2)]  (w  + X ) . 

(5.81) 


t 


r 

I 


Therefore,  by  (5.71)  and  (5.81),  for  all  a c (a*,l), 

| Va((w,l),T*)  - V ao.lj.w*)!  is  bounded  by  the  maximum  of  L^(w)  and 
the  right  side  of  (5.81). 

Q.E.D 


As sumption  3.3  Is  now  verified  in  the  following  1 


5.6 


For  the  Markov  decision  process  (x_,a  ) under  consideration  with 

n n 

cost  function  c'(x,a)  given  by  (5,39),  there  exists  an  increasing 
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sequence  {a  },  a e (0,1),  such  that  ail,  either  T*  (I,)  - 0~  or 
n n n ot  1 

n 

T*  (I.)  >0  for  all  n > 0 and  11a  T exists  where 
“n  1 “ n 


T - 

n 


, if  T*  (I. ) - 0"  for  all  a > 0 
a 1 

a 

1 T*  (0)  , if  T*  (Ij)  > o for  all  a > 0 


(5.82) 


Furtheraore,  the  connected  policy  tt  • with  threshold  T'(l)  given  by 


T’(i)  - lia  T 


a+o» 


i e (0,^1 


(5.83) 


satisfies 


*'  (x,ir' ) - lia  sup  (1  - an)  Va  (x,n*  ) 

n n 


(5.84) 


for  all  x c X. 


Proof 


Since  Lenma  5.4  showed  the  existence  of  an  a*  e (0.1)  such  that, 
for  all  a e (a*,l),  TJ(i)  is  bounded,  then  there  exists  an  increas- 
ing sequence  (a },  a e (0,1),  such  that  a +1  and  either  T*  (I,)  - O’ 
n n n a l 

n 

for  all  n or  I*  (I.)  >0  for  all  n with  the  lia  T*  (0)  well  defined, 
ci  i “ a 

n n 

Thus,  it  is  deer  that  T'(*)  defined  by  (5.84)  exists,  and  the 

connected  policy  »'  with  threshold  T' (1)  is  well  specified. 

Consider  the  case  of  TJ  (1^)  >_  0 for  all  n > 0.  Then  T'(i)  is 

n 

given  by 


) 


T'(i)  - lia  T*  (0) 
n 


(5.85) 


Using  (5.69)  for  x e (J^  x [0,»)), 
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t 


(l-an)Va  (x,n*)  - (M-Dtj 


Cl 

• lim  sup  E I am(w  + t + pD  )|xrt  ■ x}(l-a  ) 

m a_  a_  0 n 


(5.86) 


o m-0 
n 


where  x is  the  connected  policy  with  threshold  function  T (i)  - T*(0) 
a a a 

for  i e [0,1^]  and  is  given  by  (5.65).  In  a similar  manner  to 
(5.71)#  it  is  clear  that  for  x e (R^  x [0,“)) 


*'(x,x')  - (M-l)t.  - lim  sup  N_i  E ,{  l (w  + t + uD  )|x.  - x)  . 

1 n-0  n an  an  U 

(5.87) 


Since  the  right  side  of  (5.86)  represents  the  a-optimal  cost  for  the 
single  channel  case  which  by  Lemma  3.11  converges  to  the  right  side 
of  (5.87)  as  n + then  for  x e (R^  x [0,°°)) 


♦'(x.n1)  - lim  sup  (1  - ur)  Vq  (xrx*  ) . I 

n+«  n n 

Now  it  is  easily  shown  that  ♦,(x,x')  is  a constant  for  all  x e X. 
Hence  using  Lemma  5.5  and  (5.88)  for  all  x e X 


|*'(x,x')  - lim  sup  (1  - a ) Vo  (x,x*  )| 
n*»  n n 

<_  | ♦' ((0,I^,0) ,x’)  - lim  sup  (1  - «n)  Vq  ([0,^,0), x*  )| 


+ lim  sup  (1  - a ) | V (x,x*  ) - V ({0,1  ,0),x*  )j 


n ' a ' ’ a ' 
n n 


Now  consider  the  case  of  T*  (I,)  *0  for  all  n > 0.  Then 

a l 
n 

T'(i)  - 0”  for  i c [0,1^].  Clearly  from  (5.75)  for  x c (r3  x [0,“)) 


f 


3 


where  is  given  by 
hold  T'(i), 

*'(x,ir')  - 


It  is  clear  that  ♦'(x,*1)  is  a constant  for  all  x e X.  So,  using 
Leasui  5.5  and  (5.90),  in  a manner  similar  to  (5.89),  it  is  found  that 
for  x c X 

*'(x,ir')  - lim  sup  (1  - on)  Va  (x,ir*  ) . (5.91) 

n+“  n n 

Q.E.D 

Finally,  using  Theorem  3.2,  the  average  optimal  policy  is 
established. 

he—  5.7 

The  policy  «'  which  satisfies  for  all  x c X 

♦ *(x)  - ♦,(x,ir*)  - *,((0,I1,0),x')  (5.92) 

is  given  by  the  policy  described  in  Lemma  5.6. 


(5.72)  and  for  n*  a connected  policy  with  thres- 


lim  sup  (1  - a ) Va  (x,ir') 
n 

lim  sup  (1  - a ) V (x,ir*  ) . (5.90) 

u oi  Q 

n+®  n n 
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) 


3 


3 


J 


I 


3 


3 


3 


3 


Proof 


Since  by  Lenas  5.2,  5.5,  and  5.6,  Assumptions  3.1,  3.2,  and 
3.3  are  valid,  then  the  results  stated  in  the  theorem  follow  by 
Theorem  3.2. 

Q.E.D 

Now  to  verify  that  it'  is  also  a policy  ir*  which  satisfies  (5.38), 
it  is  required  to  show,  as  was  discussed  earlier,  that  for  all  x e X 
*'(x)  is  equal  to  $(x,ir').  The  following  theorem  yields  the  required 
result . 

Theorem  5.1 

Let  the  average  cost  #(x)  be  given  by  (5.34)  and  let  *'(x)  be 
given  by  (5.92)  where  the  policy  it*  given  in  Lenaa  5.6  achieves 
#'(x).  Then 

♦ (x)  - $'(x)  - *’(((>, Ilt0))  . (5.93) 

Furthermore,  the  policy  ir'  satisfies  (5.38)  and  is  the  average  optimal 
policy  for  the  Markov  decision  problem  with  costs  c(x,a)  given  by 
(5.32). 

Proof 

Flrat,  it  is  required  to  prove  that,  for  all  x e X,  ♦(x,ir’)  equals 

*'(x,v').  Generalizing  Lenaa  3.10,  it  is  clear  that  under  n'  the 

distribution  of  x » (w  ,i  ,r  ) has  a limiting. probability  distribution 
n n n n 

neglecting  T , given  by 


3 


F(W,I)  - lim  Pr(wn  < W,  in  < l|X()  - x)  (5.94) 

n^° 

which  Is  independent  of  x e X.  Furthermore,  lim  E(w  |x.  ■ x)  exists, 

n-K»  n • 

is  finite  and  is  also  Independent  of  x e X.  So,  *(x,n')  and  #'(x,ir') 
are  given  in  terms  of  F(W, I)  by 

*(x,ir’)  - J c((w,i) ,a(w>)  dF(w,I)  (5.95) 

we[0,«) 

ietO.Ij,] 

and  * 

♦'(x,n')  - | c'((w,i),a(w))  dF(w,i)  X5.96) 

we[0,“) 

icIO,^] 

where  a(w)  is  the  decision  function  given  by 


(1  , if  w£X'(i) 

2 , if  w > T'(i)  , (5.97) 


and  T'(i)  is  given  by  (5.83)  and  is  a constant  Independent  of  1. 

Now  consider  the  case  where  the  threshold  T'(i)  associated  with 
**  is  greater  than  or  equal  to  zero.  Then  it  is  clear  that  no  state 
in  the  region  x (0,~)  communicates  with  a state  in  the  region 
Rj  * I0,«»)  where  R^  and  R^  are  given  by  (5.65)  and  (5.66),  respective- 
ly. Since  from  (5.94)  the  limiting  probability  distribution  is  in- 
dependent of  the  Initial  state,  then  all  states  in  R^  x [0,»)  are 
transient  states  and 

lim  Pr(xn  c (R2  x [0,-))}  - 0 . (5.98) 

n 
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> 
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3 


3 


t 


I 


c 


I t 


c 


c 


c 


But  for  all  (w,i)  e 

c((w,i),a(w))  - c'((w,i),a(w))  . (5.99) 

Therefore,  using  (5.98)  and  (5.99)  in  (5.95)  and  (5.97)  results  in 

*(x,n')  - #'  (x,ir' ) - *((0,1^0))  (5.100) 

for  all  x e X. 

For  the  case  where  the  threshold  T'(i)  associated  with  it'  is 
0 , it  is  clear  that  no  state  in  the  region  ([0,”)  x [1^]  x [0,®)) 
communicates  with  a state  in  the  region  ([0,®)  x [0,1^)  x [0,®)). 

Thus,  in  a similar  manner  to  the  case  of  T'(i)  >_  0, 

lim  Pr{x  e ([0,®)  x [0,1.)  x [0,®))}  - 0 (5.101) 

n-*-®  n 1 

and  for  all  (w,i)  e ([0,®)  x [1^]) 

c((w,i) ,a(w))  - c’((w,i),a(w))  . (5.102) 

Hence  substituting  (5.101)  and  (5.102)  into  (5.95)  and  (5.96),  the 
following  result  is  obtained: 

*(x,ir')  - *(x,n')  - *((0,^,0))  (5.103) 

t 

for  all  x c X. 

Now  from  (5.42)  for  all  x e X,  $*(x)  #(x),  and  from  (5.100) 

and  (5.103)  for  all  x e X, 

*'(x)  - ♦,(x,ir')  - *(x,ir')  . (5.104) 

* 
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Therefore,  for  all  x e X 


$(x)  - *(x,ir')  - *(0,1^0)  . (5.105) 

Q.E.D 

Thus,  from  (5.37)  it  is  clear  that  n'  given  in  Lenmia  5.6  satisfies 
(5.3)  and  so, 

L2({Dk})  " Ln,({Dk})  * (5.106) 

Hence,  equation  (3.16)  is  a valid  expression  for  the  delay  distortion 
M 

relation  y^CD).  Since  it'  is  a connected  policy  with  threshold 
T’(i)  ■ T*  applied  to  the  waiting  time,  then  it  is  clear  that 

L2({D » - min  L2({D }) 
k nell  n k 

- min  L2  ((D))  (5.107) 

Tc[  f0,ao)U(0~)  ] nT  K 

where  is  the  connected  policy  with  threshold  T.  In  evaluating 
Lff  ({D^}),  the  case  of  a threshold  of  0 corresponds  to  the  case  of 

"t 

" D2  and  a policy  with  an  arbitrary  positive  threshold.  Thus, 
substituting  (5.107)  Into  (3.16)  results  in 

y!?(D*)  " min  (L2  ((D.  }))  - pD*  (5.108) 

2 {D^O.k-1,2}  *T  k 

{T>0} 

2 

where  {D*}  and  T*  minimize  ({D^})  and  D*  is  the  average  distortion 

T 

M 

associated  with  (D*)  and  T*.  Hence,  y2(D)  for  the  tandem  channel  case 
is  computed  by  finding  the  minimum  of  a functional  in  much  the  same 
manner  as  for  the  single  channel  case  described  in  Section  4.1.  The 
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f ' 
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2 

functional  L ({D,  }),  T ^ 0,  is  readily  evaluated  in  terms  of  express- 
^ * 

ions  derived  in  Section  4.3,  and  the  following  lemma  describes  this 
relationship. 

Lemma  5.8 

For  >_  and  T ^ 0, 

,00 

L*  ((Dk>)  - j W dF(W)  + (tx  + liDj^)  F(T)  + (t2  + pD2>  (1  - F(T)) 

+ (M~l)t1  (5.109) 

where  t^  is  given  by  (5.5)  and  F(W)  Is  the  probability  distribution 
given  in  Theorem  4.2. 

Proof 

The  expressions  (5.95),  (5.96)  and  (5.99)  used  in  the  proof  of 
Theorem  5.1  are  all  valid  for  any  connected  policy  with  a non-negative 
threshold.  Also,  for  the  case  of  a positive  threshold,  it  is  clear 
that  c((0,i) ,a(0))  in  the  integral  in  (5.95)  is  a constant  independent 
of  i.  So, 

L^({Dk})  - ♦((0,I1,0),nT) 

• | c((w,0) ,a(w))  dF(w)  (5.110) 

we[0,«) 

where  F(W),  which  is  given  in  Theorem  4.2,  is  the  limiting  distribution 
of  the  waiting  time  in  the  buffer  associated  with  the  first  channel. 
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Now  substituting  the  value  of  c((w,0),a(w))  into  (5.110)  the  express- 
ion (5.109)  results. 

Q.E.D 

M 

Thus,  the  delay  distortion  relationship  Y2(°)  for  a tandem  channel 
network  is  calculated  in  the  same  manner  which  was  used  for  calculating 
the  delay  distortion  relationship  for  a single  channel  network. 

5.2  Properties  of  y!^0) 

M 

The  delay  distrotion  relationship  y2(D)  for  the  tandem  channel 

network  defined  by  (3.13)  has  the  properties  which  are  described  in 
1 M 

Section  4.2  for  Y2(°)»  That  is,  Y2(°)  ia  a nonincreasing  function 

2 

of  distortion  D,  and  defined  in  (4.10)  is  given  by 

Dmi»  " f"1<X"1  c " »2>  • C5.111) 

and  is  independent  of  the  number  of  channels. 

An  additional  property  which  is  of  interest  is  associated  with 
M 

the  behavior  of  Y2^)  aa  a function  of  the  number  channels  M for  a 
fixed  0.  The  following  theorem  indicates  that  the  advantage  of  using 
an  adaptive  data  compression  decays  as  the  number  of  channels  in 
tandem  increases. 

Theorem  5.2 

For  the  source  and  tandem  channel  network  described  in  Section  3.1, 

2 

let  C“  C for  all  m and  let  D > D . . Then 

m min 
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p2  C"1  + (yJ(D)  - y>)  <_  (Y-(D)  - Y2~1(D))  (5.112) 

where  p2  is  the  associated  protocol  information. 

Proof 

Suppose  that  policy  it  and  distortion  values  and  D^,  ^ D2* 

M 

achieve  YjO3)*  From  (5.108)  it  is  clear  that  tr  must  be  a connected 

2 

policy  and  D < D.  Let  T ({ D,  })u  . be  the  delay  through  a tandem 
L 7T  R n*l 

network  of  M-l  channels  when  policy  it  Is  used  with  data  compressors 
with  associated  distortion  levels  of  and  D^.  From  the  expression 
for  t^((D^})m  given  by  (3.11),  it  is  clear  that 

Y*(D)  - T^((Dk})M_1  - (f(Dx)  + P2)C-1.  (5.113) 

Since  the  policy  n together  with  distortion  values  and  D2  achieve 
an  average  distortion  less  than  or  equal  to  D,  then 

TJ({Dk})M-l  - Y2"1(D)  * (5*114) 

Thus, 

Y2(D)  ' y2~1(D)  - [?(D)  + P21C_1  (5.115) 

where  ?(D)  is  assumed  to  be  a nonincreasing  function  of  D. 

Now  from  (3.10), 

Y>  - yJ_1(D)  - r(D)  C'1  . (5.116) 

So,  using  (5.115)  and  (5.116),  one  obtains 
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OrJ_1(D)  - y^”1(D)}  - CyJ(D)  - Y2<I»>  1 p2  c”1  • (5.117) 

Q.E.D 

The  implication  of  Theorem  5.2  Is  that  for  a fixed  distortion 
level  as  the  nuafcer  of  channels  In  tandem  Increases,  the  nonadaptlve 
data  compression  scheme  approaches  and  then  surpasses  the  performance 
of  the  adaptive  scheme.  This  property  is  exhibited  in  the  following 
section  with  numerical  results. 

5.3  Miner leal  Results 

To  Illustrate  the  degradation  in  performance  as  the  number  of 
channels  in  tandem  increases  of  an  adaptive  data  compression  scheme 
when  compared  with  the  nonadaptlve  data  compression  scheme,  the 
relevent  delay  distortion  relationships  are  evaluated.  The  communi- 
cation system  to  be  examined  uses  the  same  models  as  were  used  in 
Section  4.5.  However,  the  channel  considered  consists  of  M channels 
in  tandem  with  all  channels  transmitting  information  bits  at  rate 

ctSSEiJ. 

sec. 

w 

The  delay  distortion  relationship  y^(D)>  for  the  nonadaptlve 
H 

scheme,  and  y2  (D) , for  the  adaptive  scheme,  are  evaluated  using 
(3.10)  and  (5.110).  The  quantity  y2(D)  is  computed  following  the 
procedures  ussd  in  Section  4.5  to  compute  Y20>).  Figure  5.1  presents 
the  difference  relationships  y^(D)  - y20>)  for  various  numbers  of 
channels  in  tandem,  M.  Figure  5.2  presents  the  same  difference 
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relationships  presented  in  Figure  S.l  with  the  effects  of  protocol 


U 


information  eliminated  by  setting  the  protocol  information  p^  equal 
to  zero. 

From  the  figures  it  is  evident  that  as  M increases  the  gains 
derived  from  using  the  adaptive  compression  schemes  are  lost.  Also, 
the  figures  indicate  that  the  effects  of  protocol  information  on  the 
relative  performance  of  the  adaptive  scheme  are  significant  when 
there  are  a large  ntmiber  of  channels  in  tandem  and  the  distortion 
levels  are  low.  Hence,  adaptive  data  compression  schemes  lose  their 
performance  advantage  as  the  nuaber  of  channels  in  tandem  Increases 
as  was  shown  in  Theorem  5.2.  This  is  expected  since  as  the  number  of 
channels  in  tandem  grows  the  transmission  delay  prevails  over  the 
queueing  delays. 

5.4  Conclusions 

The  delay  distortion  relationship  for  a communication  system 
employing  a tandem  channel  network  and  an  adaptive  data  compression 
scheme  was  considered  in  this  chapter.  The  optimal  structure  of  the 
adaptive  data  compression  scheme  which  achieves  this  relationship 
was  shown  to  be  identical  to  the  optimal  structure  for  a single 
channel  network.  Using  the  optimal  structure,  the  delay  distortion 
relationship  was  shown  to  be  specified  by  a minimization  of  a func- 
tional over  a vector  space.  This  functional  was  shown  to  be  readily 
evaluated  using  the  results  of  Chapter  IV.  Subsequently , properties 
of  the  relationship  were  reviewed  and  examples  of  the  delay  distortion 
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relationship  were  presented.  The  examples  demonstrate  the  rate  at 
which  the  performance  advantage  of  the  adaptive  data  compression 
versus  the  nan-adaptive  scheme  Is  lost  as  the  number  of  channels 

tandem  Increases. 
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CHAPTER  VI 


CONCLUSION  AND  SUGGESTIONS  FOR  FUTURE  RESEARCH 
6.1  Suanary  and  Conclusion 

In  this  dissertation  the  coding  of  sources  under  a fidelity 
criteria  was  examined  for  a class  of  sources  which  emit  randomly 
occurring  messages.  Such  a class  of  sources  is  employed  to  model 
information  carrying  processes  entering  a coma  unicat ion  network.  They 
differ  from  the  normal  models  of  sources  found  in  information  theore- 
tical studies,  since  the  latter  are  generally  assumed  to  emit  messages 
on  a regular  temproal  basis.  For  such  a class  of  network  sources,  the 
rate  distortion  function,  R(D),  was  evaluated,  and  source  coding  end 
converse  source  coding  theorems  were  proved.  From  these  theorems 
a new  operational  definition  of  R(D)  in  terms  of  message  queueing 
delays,  and  transmission  delays  was  determined.  This  operational 
definition  is  observed  to  constitute  a natural  extension  of  the 
usual  operational  definition  of  R(D).  Furthermore,  it  relates  R(D) 
to  the  message  delay  in  the  network,  which  is  an  Important  performance 
measure  in  the  evaluation  of  communication  networks. 

Then  for  this  class  of  sources  an  adaptive  data  compression  scheme 
was  presented  and  the  delay  distortion  relationship,  which  forms  the 
trada-off  between  massage  delay  In  the  communication  network  and  the 
distortion  level,  was  studied.  This  adaptive  data  compression  scheme 
utilizes  observations  of  the  network  congestion  to  determine  the  amount 
of  compression  a massage  receives,  with  the  objective  of  minimising  the 


) 


message  delay  for  a given  distortion  level.  Using  results  from  Markov 
decision  theory  and  extensions,  which  were  derived  in  the  dissertation, 
of  Markov  decision  theory  to  nondenumerable  state  spaces  and  un- 
bounded costs  functions,  the  structure  of  the  optimal  adaptive  data 
compression  scheme  was  determined  for  tandem  channel  networks. 

Following  the  establishment  of  the  structure  of  the  optimal  scheme, 
a queueing  analysis  was  performed  for  the  communication  system  which 
utilizes  a tandem  channel  network.  From  this  analysis  the  delay 
distortion  relationship  was  shown  to  be  expressible  in  terms  of  a 
minimization  of  a functional  over  a vector  space.  Numerical  results 
were  presented  to  demonstrate  that  for  low  distortion  levels  the 
adaptive  data  compression  scheme  Is  an  effective  means  of  reducing 
the  message  delay  obtained  by  nonadaptive  schemes.  Furthermore, 
numerical  results  have  demonstrated  that  as  the  number  of  channels 
in  tandem  increased  the  advantages  of  using  the  adaptive  scheme 
versus  the  nonadaptive  scheme  diminished. 

6.2  Suggestions  for  Future  Research 

Several  possible  directions  for  future  work  will  now  be  Indicated. 
In  this  dissertation,  for  a class  of  sources  which  emit  randomly 
occurring  messages,  a source  coding  theorem  was  proved  using  message 
block  codes.  As  was  mentioned  In  Chapter  II,  a source  coding  theorem 
could  be  proved  for  this  class  of  sources  using  block  source  coding 
procedures  which  smploy  the  encoding  of  a block  of  duration  T of  the 
realisation  of  the  source.  Investigation  of  message  block  codes 
end  time  block  codes,  in  terns  of  performance  and  complexity,  would 
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be  worthwhile.  Furthermore,  extending  the  coding  theorem  for  message 
block  codes  to  convolutional  codes  would  be  of  Interest  as  well. 

Related  to  adaptive  data  compression  schemes,  further  studies 
are  needed  to  extend  the  results  presented  in  this  dissertation 
to  more  complex  data  compression  schemes  and  more  complex  networks. 

One  such  problem  is  the  determination  of  the  optimal  structure  of 
the  decision  policy  for  adaptive  data  compression  schemes  employing 
more  than  two  data  compressors. 

Furthermore,  in  this  dissertation,  novel  techniques  were  de- 
rived, based  on  Markov  decision  theory,  to  answer  questions  concerning 
the  existence  of  optimal  decision  policies.  These  techniques  seem 
to  be  applicable  to  other  queueing  systems  employing  adaptive  con- 
trols such  as  the  multitude  of  flow  control  procedures  in  communi- 
cation networks.  Investigation  of  such  applications  employing  the 
techniques  derived  here  would  be  Important  and  highly  rewarding. 


REFERENCES 


!•  Shannon,  C,  E.  "A  Mathematical  Theory  of  Communication,"  Bell 
Systema  Technical  Journal.  Vol.  27,  pp.  379-423,  623-656,~JuTy, 
October  1948. 

2.  Shannon,  C.  E.  "Coding  Theorems  for  a Discrete  Source  with  a 
Fidelity  Criterion,"  ire  National  Convention  Record.  Part  4. 
pp.  142-163,  1959. 

3.  Gallager,  R.  G.  Information  Theory  and  Reliable  Communications. 
John  Wiley,  New  York,  1968. 

4.  Berger,  T.  Rate  Distortion  Theory:  A Mathematical  Basis  for 

Data  Compression.  Prentice-Hall,  Englewood  Cliffs,  New  Jersey, 
1971. 

5.  Kirch,  S.  I.  and  T.  Berger.  "Coding  for  Delay-Dependent  Fidelity 
Criterion,"  IEEE  Trans,  on  Information  Theory.  Vol.  IT-20,  pp.  77- 
85,  January  1974. 

6.  Davies,  D.  W.  and  D.  L.  A.  Barber.  Communication  Networks  for 
Computers,  John  Wiley,  London,  1973. 

7.  Abramson,  N.  and  F.  F.  Kuo,  ed.  Computer-Communication  Networks. 
Prentice-Hall,  Englewood  Cliffs,  New  Jersey,  1973. 

8.  Green,  P.  E.  and  R.  W.  Lucky,  ed.  Computer  Communications.  IEEE 
Press,  New  York,  1975. 

9.  Chu,  W.  W. , ed.  Advances  In  Computer  Communications.  Artech  House, 
Dedham,  Massachusetts,  1974. 

10.  Kleinrock,  L.  Communication  Nets;  Stochastic  Message  Flow  and 
Delay.  McGraw-Hill,  New  York,  1964. 

11.  Kleinrock,  L.  Queueing  Systems.  Vol.  2,  John  Wiley,  New  York, 

1976. 

12.  Rubin,  I.  "Communication  Networks:  Message  Path  Delays,"  IERE 

Trans,  on  Information  Theory.  Vol.  IT-20,  pp.  738-45,  November 
1974. 

13.  Rubin,  I.  "Data  Compression  for  Communication  Networks:  The 

Delay-Distortion  Function,"  IEEE  Trans,  on  Information  Theory. 

Vol.  IT-22,  November  1976. 

14.  Blackwell,  D.  "Discrete  Dynamic  Programming,"  The  Annals  of 
Mathematical  Statistics.  Vol.  33,  pp.  719-726,  1962. 


190 


* 


I 


c 


£ 


l 


t 


t 


C 


£ 


15.  Howard,  R.  A.  Dynanlc  Programing  and  Markov  Processes , John  Wiley, 
New  York,  1960. 

16.  German,  C.  "On  Sequential  Decisions  and  Markov  Chains,"  Management 
Science , pp.  16-24,  October  1962. 

17.  Derman,  C.  "Denumerable  State  Markov  Decision  Processes  - 
Average  Cost  Criterion,"  The  Annals  of  Mathematical  Statistics, 

Vol.  37,  pp.  1545-1554,  1966. 

18.  Ross,  S.  M.  "Non-Discounted  Denumerable  Markovian  Decision  Models," 
The  Annals  of  Mathematicsl  Statistics.  Vol.  39,  pp.  412-423,  1968. 

19.  Ross,  S.  A.  "Arbitrary  State  Markovian  Decision  Processes,"  The 
Annals  of  Mathematical  Statistics.  Vol.  39,  pp.  2118-2122,  1968. 

20.  Lippman,  S.  A.  "Semi-Markov  Decision  Processes  with  Unbounded 
Rewards,"  Management  Science.  Vol.  19,  pp.  717-731,  March  1973. 

21.  Lippman,  S.  A.  "On  Dynamic  Programming  with  Unbounded  Rewards," 
Management  Science.  Vol.  21,  pp.  1225-1233,  July  1975. 

22.  Reed,  F.  C.  "Denumerable  State  Markov  Decision  Processes  with 
Unbounded  Costs,"  Department  of  Operations  Research  and  Department 
of  Statistics,  Stanford  University,  Stanford,  California,  Techni- 
cal Report  No.  157,  November  1973. 

23.  Harrison,  J.  M.  "Discrete  Dynamic  Programming  with  Unbounded 
Rewards,"  The  Annals  of  Mathematical  Statistics,  Vol.  43,  pp.  636- 
644,  1972. 

24.  Awate,  P.  "Dynamic  Programming  with  Negative  Rewards  and  Average 
Reward  Criterion,"  Department  of  Operations  Research,  College 

of  Engineering,  Cornell  University,  Ithaca,  New  York,  Technical 
Report  No.  251,  May  1975. 

25.  Crabill,  T.  B.,  D.  Gross  and  M.  J.  Magazine.  "A  Survey  of  Research 
on  Optimal  Design  and  Control  of  Queues,"  Institute  for  Management 
Science  and  Engineering,  School  of  Engineering  and  Applied  Science, 
The  George  Washington  University,  Washington,  D.C.,  Serial  T-280, 
June  1973. 

26.  Sobel,  M.  J.  "Optimal  Control  of  Queues,"  Mathematical  Methods  in 
Queueing,  Western  Michigan  University,  Kalamazoo,  Michigan, 

May  10-12,  1973,  Spring-Verlag,  Berlin,  pp.  231-261,  1974. 

27.  Stidham,  S.  and  N.  U.  Prabhu.  "Optimal  Control  of  Queueing 
Systems,"  Mathematical  Methods  in  Queueing.  Western  Michigan 
University,  Kalamazoo,  Michigan,  May  10-12,  1973,  Springer-Verlag, 
Berlin,  pp.  264-294,  1974. 


191 


28.  Shaw,  L.  "Busy  Period  Control  of  Queues  Based  on  Waiting  Times 
at  Arrivals,"  Operations  Research,  Vol.  24,  pp.  543-563,  May- 
June  1976. 

29.  Rubin,  I.  "Information  Rates  for  Poisson  Sequences,"  IEEE  Trans. 
on  Information  Theory.  Vol.  IT-19,  pp.  283-294,  May  1973. 

30.  Smith,  W.  "Renewal  Theory  and  Its  Ramifications,"  Journal  of 
the  Royal  Statistical  Society,  Series  B,  Vol.  20,  pp.  243-302, 
1958. 

31.  Rudin,  W.  Real  and  Complex  Analysis.  McGraw-Hill,  New  York,  1966. 

32.  Kingman,  J.  F.  C.  "Some  Inequalities  for  the  Queue  GI/G/1," 
Biomet rika.  Vol.  49,  pp.  315-324,  1962. 

33.  Lindley,  D.  V.  "The  Theory  of  Queues  with  a Single  Server," 
Proceedings  of  the  Cambridge  Philosophical  Society,  Vol.  48, 
pp.  277-289,  1952. 

34.  Klnchlne,  A.  Y.  Mathematical  Methods  in  the  Theory  of  Queueing. 
Hafner,  New  York,  1969. 

35.  Wldder,  D.  V.  The  Laplace  Transform.  Princeton  University  Press, 
Princeton,  New  Jersey,  1941. 

36.  Cohen,  J.  W.  The  Single  Server  Queue,  North-Hollai.d,  Amsterdam, 
1969. 

37.  Royden,  H.  L.  Real  Analysis.  MacMillan,  New  York,  1968. 

38.  Doob,  J.  B.  Stochastic  Processes.  John  Wiley,  New  York,  1953. 

39.  Chung,  K.  L.  Markov  Chains  with  Stationary  Transition  Prob- 
abilities. Sprlnger-Verlag,  New  York,  1967. 


192 


APPENDIX 


» 


I 


■X 


c 


c 


c 


( 


SOLUTION  TO  INTEGRAL  EQUATION 


The  integral  equation  (4.24)  introduced  in  Section  4.3  is 
considered  in  this  Appendix.  In  Section  A.l,  the  general  form  of 
the  solution  to  integral  equations  of  this  type  is  found.  In 
Section  A. 2,  the  specific  forms  related  to  the  Integral  equation 
(4.24)  are  then  developed.  Finally  in  Section  A. 3,  these  specific 
forms  are  used  in  the  calculation  of  terms  required  in  the  evaluation 
of  the  mean  waiting  time. 


A.l  General  Solution  to  the  Integral  Equation 

The  Integral  equation  (4.24)  can  always  be  written  in  the 
following  form: 


N rx-c 

G(x)  * I ai  *1(x)  - b G(y)  dy  , x ^ T (A.l) 

i-1  •'  -® 

where 

G(x)  - 0 , x < T , (A. 2) 


z^(x)  are  integrable  functions,  and  a^,  b,  c,  and  T are  real  valued 
numbers.  Now  consider  Integral  equations  of  the  form 

rx-c 

Gj(x)  - z±(x)  - b j G1(y)  dy  , x ^ T (A. 3) 

» mOO 

where 


G^x)  - 0 , x < T 


(A. 4) 
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for  i - 2,  2,  ...»  N.  Clearly,  by  substitution  into  (A.l) 


n 

G(x)  - l a G.(x) 
i-1  1 


(A.  5) 


Hence,  the  solution  to  Integral  equation  (A. 3)  needs  to  be  investigated. 
The  following  lemma  provides  a solution  to  this  integral  equation. 


Let  G(x)  satisfy 


fXfC 

G(x)  - z (x)  - b G(y)  dy 

J mmCO 


x > T 


(A. 6) 


where 


G(x)  - 0 


, x < T , 


(A.  7) 


z(x)  is  an  lntegrable  function  and  b,  c,  and  T are  real  valued  num- 


bers. Then 


[(x-T)/c] 

G(x)  - I d Ax) 
r-0 


(A.  8) 


where  [x]  is  the  largest  integer  less  than  or  equal  to  x,  and  dp(x) 
is  given  by 


ao(x)  " | 


z(x)  , if  x T 

0 , if  T < x 


(A. 9) 


and  for  r - 1,  2,  ... 


dr(x) 


fx-c 

- b J dy  , if 


x > T + rc 


, if  x < T + rc  . (A.  10) 


Proof 

For  T <_  x < T + c,  clearly 

G(x)  - z(x)  - dQ(x)  (A. 11) 

satisfies  (A. 6).  Let  x > T and  assume  for  all  x < x,  (A. 8)  is  the 
solution  of  (A. 6).  Then  substituting  (A. 8)  into  (A. 6),  G(x)  is  given  by 

r*-c  ( (y-T)/c] 

G(x)  - z($)  - b l d (y)  dy  . (A.12) 

JT  r-0  r 

So,  interchanging  order  of  integration  and  susmation  in  (A.12), 

I(x-T-c)/c]  rx-c 

G(x)  - z(x)  + l (-  b d (y)  dy)  . (A. 13) 

r-0  h r 

Using  (A. 9)  and  (A. 10)  in  (A. 13),  it  is  clear  that 

( (x-T)/c] 

G(5)  - l d (x)  . (A.  14) 

r-0  r 

Hence,  by  induction  on  x the  result  (A. 8)  is  proved. 

Q.E.D 

A. 2 Specific  Solution  to  the  Integral  Equation 

In  the  solution  of  (4.24),  three  integral  equations  of  the  form 
(A. 6)  need  to  be  solved.  The  following  lemmas  solve  these  equations 
using  the  results  of  Lemma  A.l. 


r 


T 

where  Gt  (x)  ■ 0 for  x < T and  i»k,  t^,  and  T are  positive  numbers. 


Then 


G*  (x)  - 

zk 


[(x-T)/t.  ](-p  (x  - rt,))r 

l * j s , if  x > T 

r-0  rl 


, if  x < T . (A. 16) 


Proof 


Let  dy(x)  be  as  in  (A.9)  and  (A. 10)  with  z(x)  - 1.  Then 


( 1 , if  x > 

iQ(x)  -J 

l 0 , if  x < 


(A. 17) 


Assume 


df(x)  - 


(~Mk(x  - rtk))r 


r! 


, if  x T + rt 
, if  x < T + rt. 


k 


(A. 18) 


Then  from  (A. 10) 


Wx) " 


r-t  rtk)>r 

Jt 


r! 


- dy,  if  x >_  T + (r+l)c 

, if  x < T + (r+l)c  . 

(A. 19) 


Performing  the  integration  in  (A.19)  results  in 

(-Wk(x  - (r+l)tk))rfl 


dr+l(K)  " 


(r+1) ! 


, if  x > T + (r+l)c 
, if  x < T + (r+l)c  . 

(A.  20) 
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r 


f. 


G 


Hence,  by  Induction  for  r >_  0 

W*  - rtk»r 


r! 


dr(x)  H 


and  applying  Lena  A.l, 

„T 


, if  x >_  T + rc 

, if  x < T + rc 

(A. 21) 

k1 

k dr(x) 

(A. 22) 

1 of  (A. 21)  into  (A. 22). 

Q.E.D 


A. 3 


o 


Let  Gn(x)  satisfy 

u 


;(x)  - G°  (x)  - u2  f^2  G^(y)  dy 
1 ' T 


x > T 


(A. 23) 


where  G*(x)  - 0 for  x < T,  G®  (x)  is  given  by  (A.16)  and  p2>  t2  and  T 
are  positive  numbers.  Then 


T/_\  _ ^l1  At 


Gj(x) 


r-0 


G;(x) 


(A. 24) 


0 


where 


,[(x-rt1)/t2]  (-p^*  (-P2)V  (x-rtj-vt2) 

i 0 


<r+v)l 


, if  rt  > T 


g!(x) 


,I(X“TT)/t2\  , .v  5 

l (“W.)  (-P-)  l 

v-0  u"0 


(x-X-vt2)U+V  (T  - u^)™ 


(u+v) 1 


(r-u)l 


, if  rtj  < T .(A* 25) 


1*7 


Proof 


Substituting  (A. 16)  for  Gu  (x) 


GG<*> 


tX(V  <-«!<«  - rt1))r 

" rio  rl 


in  (A. 23),  G*(x) 
fX~^2  T 

JT  °GW  dy 


Is  given  by 
, x >_  T.  (A. 26) 


Clssrly,  (A. 26)  is  in  the  fora  of  (A.l)  whose  solution  is  given  by 
(A. 5)  in  the  fora 


[x/tj] 


GG(x)  “ l <£<*> 


r-0 


(A. 27) 


where  6*(x)  satisfies 


(-P (x  - rt,))r  /-x-t 


,T  v-mx  - rt.jj  fx- 1, 

! (x) — i w 6*<y)  dy  , x > 

' Z >aax(T,rtl)  r 


rt. 


1* 
(A. 28) 


Consider  two  cases,  T >_ rtx  and  rtx  < T.  For  the  first  case  of 

T > rt1#  let  d*(x)  be  as  in  (A.9)  and  (A. 10)  with 

, (-P,(x  - rt-))r 

*<*> — „ 1 . 


(A. 29) 


Then 


dj(x)  - 


(-^(x  - rtj))r 


rl 


, if 
» if 


x >_  rtj^ 
x < rt. 


(A.  30) 


and  by  repeated  use  of  (A. 10) 


. r+v 


d^(x)  «< 


<-Mj>r  (“P2)V  (*  ~ rtj  - vt2)' 

(75^)1 » if  * i «!  + v4 

* if  x < rtj  + vt2  • (A. 31) 


Applying  Lemma  A.l 

[(x-T)/t,J 

G*<*>  * l <£<*> 


v-0 


l(x-T)/t2]  (.Ui,r  (.Uj)v  (J  . rt]  _ vtj) 

(r+v) ! 


r4v 


I 

v-0 


(A. 32) 


for  T >.  rtj. 


For  the  second  case  of  rtx  < T,  let  d*(x)  be  as  In  (A.9)  and 
(A. 10)  with 


- (-U  (x  - rt.))r 

• W - — ■ rl  1 


(A.  33) 


o 


0 


Using  the  binomial  expansion,  z (x)  Is  given  by 

2,  . r I/r..  " «,) 


So, 


'(x)  - (w  )r  f fa  - vY  - rv 
1 u=oV  uI  / <**>'  * 

4t  f (x  - T)U  <T  " ’‘V 

■wi)  ..L  u! (7^57"  * lf  x i 


dQ(x)  -< 


and  by  repeated  use  of  (A. 10) 


(A. 34) 


, If  x < T (A.  35) 


<r00  “ (-Mj) 


r,  I f'-T-'<r\«  - «!>”* 

(~“z>  io\  • 


(A. 36) 
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A* 


_ 


r 


Applying  Leu  A.l  to  (A.28)t 


l(*-T)/t2] 

“ I ^(*) 

v-0  V 
K*-T)/t2] 


- r r (x  - T - vt2)^  (T  - ut  )r_u 

l <-P1)r  (-w2)v  l 1 1 


v-0 


u-0  (lHv)  1 


for  rt^  < T. 


(r-u) ! 


(A. 37) 


Q.E.D 


A.4 


Lot  G^(x)  satisfy 

-Xx  fX~l9 

Ge(*)  ■ « ” P2  J0  1 Ge^  dy  * * i 0 (A. 

whsra  Ce(x)  - 0 for  x < 0,  and  X,  u2  and  t£  are  positive  nuabers. 
Than 


38) 


l*/t2] 


. r-v 


l <-w-)r{(-x) 


G (x)  -- 


rM> 


X)  o 

- 1 ■ 

v-1 

(r-v)! 

(-x)“v) 

» if 

x >.  0 

. if 

x < 0. 

• 

(A.  39) 

Proof 


Lot  dr(x)  bo  glvon  by  (A.9)  and  (A.10)  whore 


«(*)  • • 


-Xx 


(A.  40) 
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. 


I 

si 


Then 


C 


L 


C 


do(x)  " 


e-Xx  , if  x > 0 


, if  x < 0 


and  by  repeated  use  of  (A. 10) 


.r-v 


(A. 41) 


dr(x)  -< 


-X(x-rt-)  n (x-rt_) 

(-U2)r  C(-X)“r  e ’ \ (r-v)t'  <“X)  }*  lf  * 1 rt2 


, if  x < rt2# 
(A. 42) 


Applying  Lemma  A.l,  G#(x)  is  given  by 


[*/t2] 

G (x)  - l d <x) 
r»0 


(A. 43) 


C 


Hence,  upon  substitution  of  (A. 42)  into  (A.43),  (A. 39)  results. 

Q.E.D 


To  insure  the  proper  normalization  of  the  solution  of  (4.24), 

T T 

various  limits  of  the  functions  (x),  Gg(x),  end  Cf(x)  defined  in 
A.2,  A, 3,  and  A.4  need  to  be  determined.  The  following  lemma 


determine  the  limits  by  using  the  final  value  theorem  of  Laplace 
transform  theory. 


O 


A.3 


Let  G (x)  be  given  by  (A.16)  where 
*2 
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J 


-Xt, 


w2  ■ x e 


(A. 44) 


and  Xt2  < 1.  Then 


, ™ XT 

11m  e G (x)  - ■; rr~ 

x~  C2  A Z2 


(A. 45) 


Proof 


...  \ _ m 

Let  6*  (s)  be  the  Laplace  transform  of  e * G (x), 
t2  t2 

(A. 16) 


So,  using 


8*  (.)  - .iT  M £ — T 

*2  r-0  r! 


[x-T/t«] 


2 J (-X (x-rt2-T) )r  X (x-ra2-T) 


u(x-rt2-T)} 


- -X (x-rt-T)  X (x-ra-T) 

• l LH -r? )r  • 2 u(x-rt--T) } 

r-0  rl  * 


(A. 46) 


where  u(x),  (4.83),  Is  the  unit  step  function  at  zero.  Using  the 
relationship 


L{ix~VT  eX(x"A)  U(x  - A)) --*A-  . 

rl  (a  - \)t+1 

In  (A. 46),  results  In 


How,  from  the  final  value  theorem  of  Laplace  transforms. 


(A. 47) 


(A.  48) 


1 


1 


3 


3 


1 


) 


0 


11*  eXx  G*  (x)  - 11a  8 G*  (a) 
2 r+°°  2 


So,  using  (A. 48)  and  1 'Hospital  rule  In  (A. 49), 


, . Xx  „T  , x e 
11a  e G (x)  - z — 

2 1 ~ 


' 1 - Xt 

2 1 AC2 


Lemma  A. 6 


Let  GA(x)  be  given  by  (A. 25)  where 


■ X e 


and  Xt^  < 1.  Then 


T T 

lla  eXx{GG1(x)  - Gg2(x)} 


l^/ti^  T T 

l arl  - L 2) 

r-0  r r 


where 


1/(1  - xt2) 


r (-X(T-rt. ))“  X(T-rt-) 

Jo  * 


, If  T < rtx 


, If  rtj^  < T 


and  Tx  < T2. 


Proof 


T T 

Lot  G^*  2(x)  be  given  by 


» 


T T T T 

Gr1  2(x)  - Gr1(x)  - Gr2(x)  (A.5< 

-T  «T1T2 

where  G (x)  Is  given  by  (A.25).  Furthermore,  let  G (s)  be  the 

r T T r 

Xx  a1j2 

Laplace  transform  of  e G (x).  Then,  from  (A. 24),  the  Laplace 
T T T 

transform  of  e*x  G^*  2(x)  Is  given  by 

X X oa  X X 

LUXs( G^Cx)  - Gg2(x))}-  l G 1 2(s)  . (A.5S 

r-0 

T T 
Xx  i 2 

Taking  the  Laplace  transform  of  e G^  (x)  and  Interchanging  the 
order  of  summation  and  transformation  in  the  resulting  expression  and 


using  (A. 47),  firX  2(s)  is  given  by 


, if  T2  < rtx 


-.<rtl^2)  X(T  t) 


r , ,xr+v  e 

l <~x>  « 

v-0  ( (s-X)1 


2 


-«,*«,>  (I  . rt  )*-, 

. .»**« 5^Ti i • lf  T1  i rti  ' T" 


u-0  (s-X) 


‘1  - “1  2 


r r+v  5 -<Xrtr*vt2)  j(T  - rt  )r_u  -T  (s-X) 

I <-x>  X 5 — Hteirr — • 

v-0  u-0  (s-X)^r+'*+1  I 'r  U)I 


(T2  - rtj)  -T2(s-X)| 


(r-u)l 


, if  rt2  < T2  , 


(A.  56) 
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I 


Exchanging  Che  order  of  summation  in  (A. 56), 


, if  T2<  rtx 


-art. 


s-A+Ae 


..  -sto  I , ,„r  L 


-T2(s-A) 


8t2  l(s-A)r  u*0  (8-X)U 


T.T- 

Gr1  2 (8) 


(T2  - rt^)r-U  -Art1  j 

(r-u) ! e l * 


-Art, 


if  <_  rtx  < T 


(~A)r  e * r 1 jCTj  ~ rtL)  -T2(s-A) 

8-A+Ae'8t2  --  "u  1 (r'u)l 


u"0 


(8-A)U  I (r-U)’ 


(T2  - rtj)1”0  -T2(8-A) 

(r^uTi  * 


if  rt^  < • 


Now,  from  Che  final  value  theorem  of  Laplace  transforms  and 

T T T T 

lim  eXx(GG1(x)  - GG2(x))  - lim  a L{#Xx(G(,1(x)  - GG2(x))> 

9+0 


r “^1^2 

* lim  l s G 1 2(s) 
e+0  r-0  r 

But  from  (A. 52)  and  l'Hospital  rule, 

aTlT2  1 T1  T2 

lim  6 1 2 (.)  - y-j1-  (L  1 - L 2) 

r*0  * AC2  r r 

where  L*  is  given  by  (A.53).  So,  substituting  (A.59)  into  (A.58) 
it  is  clear  that 

Ax  T1  T2  T1  T2 

lim  eAX(GG1(x)  - Gg2(x))  - J (L 1 - L 2) 

~k“  r-0 
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(A. 57) 
(A. 55), 

(A.58) 

(A.59) 

(A.  60) 


Q.E.D 


A.  7 


Let  Gfl(x)  be  given  by  (A. 39),  where 

-Xt 

P2  - x e 


(A. 61) 


and  Xt2  < 1*  Then 


11m  eXx(G  (x)  - e'XT  G (x  - T))  - - -r  X\ 

6 6 1 ‘ Xt2 


where  T > 0. 


Proof 


From  (A.  39) 


(A. 62) 


eXx(Ge(x)  - e“XT  Ge(x  - T)) 


l*/t2l 


z*  ( r 

- I 1-  l 

r-0  ( v-1 


r (-X (x  - rt2)) 


r-v 


X (x-rt2) 


(r-v) I 


(x-T/t2l 


" j r (-X (x  - rt2  - T)) 

r-0  i 


r-v 


v-1 


(r-v) l 


X (x-rt2-T) | 


I - 

(A.  63) 


X 

Let  G0(a)  be  the  Laplace  transform  of  (A. 63).  Then  by  Interchanging 
the  order  of  susast ion  and  transformation  In  the  expression  for  Gft(s) 

«T 

and  using  relationship  (A.47),  G (s)  Is  given  J>y 


. (e-1)  (1  - e~*T) 

- X * x.‘“’) 


(A. 64) 
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Now,  from  the  final  value  of  theorem  of  Laplace  transforms  and 


l'Hospital  rule, 


lim  eXx(G  (x)  - e_XT  G(x  - T))  - lim  s G*(s) 
e e s+0  6 


A. 3 Terms  in  the  Evaluation  of  the  Mean  Waiting  Time 

In  the  evaluation  of  the  mean  waiting  time  in  Theorem 
is  required  to  evaluate  the  limits  of  specific  functions, 
following  lemnas  provide  the  limits  to  these  functions. 


A.  8 


Let  G^  (x)  be  given  by  (A. 16)  and  let 
t2 

H*  (x)  - ( y d(eXy  G*  (y)) 
C2  * 0 Z2 


with  Xt2  < 1.  Then 


..  „T  , . XT/ T 

11m  H (x)  - e (i-7-T 

C2  V1  Xt2  2(1 


[(  T Xt22  \ 

V1  " Xt2  2(1  - Xt,)2/ 


Proof 

The  Laplace  transform  of  eXx  G^  (x)  which  is  denoted  i 

x T 

given  by  (A.48).  Clearly  from  (A.66)  the  transform  of 


(A.  65) 

Q.E.P 

4.3,  it 
The 


(A.66) 


(A. 67) 


G*  (s)  is 
c2 

(x)  is  given 


207 


a.  68) 


by 

Kl^O,))  --ifj  (.<*(.»  . 

Using  the  final  theorem  of  Laplace  transforms  and  (A. 68), 

11m  H*  (x)  - liw  s L(H*  (x)) 
r+~  c2  r*0  c2 

- - 11m  ~~  [s  G*  (s)J  . (A. 69) 

s-*0  as  C2 

Finally,  substituting  (A. 48)  in  (A.69)  and  evaluating  the  limit  In 
(A. 69),  the  result  (A. 6 7)  is  proved. 

Q.E.P 


A. 9 


Let  G^Cx)  be  given  by  (A. 24)  and  let 

T T rx  T T 

HG1  2(x)  " P y «l(«Xy(GG1< y)  - GG2(y)» 


with  Xtj  < 1.  Then  for  < T2 


T T 

11m  H 1 Z(x)  - ~7—  £ (h  1 - h 2) 

m*-  G 1 Xt2  r-0  r r 


[T2/ti]  _ 


(A.  70) 


(A.  71) 
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where 


2r (1  - Xt2)(l  - Xt1)  + (Xt2) 


2X(1  - Xt2)' 


, if  T < rtl 


hT- 


„ , X(T-rt.) 

I ^(~X(T-rtl))  j [2(l-Xt2)(TX+u-r)  + <Xt2r]  e 1 


2X(1  - Xt2) 


if  rt^  < T 


(A. 72) 


Proof 

t T 

Prom  (A. 55),  the  Laplace  transform  of  eAX(G_A(x)  - G 2(x))  is 
• T I GO 

r *1/2 

denoted  I G (s).  In  a similar  manner  to  (A. 68)  and  (A. 69) 
r-0  r 


T T oo  T T 

UHg1  2(x))  - - i|-  (.  I firJ  2(s))  (A. 73) 


and 

T T oo  T T 

llm  H 1 2(x)  -11.  l.(  l G 1 2(.))]  . (A. 74) 

jr-  G s-*0  d‘  r-0  r 

T.T. 

Then,  subsituting  (A.57)  for  z(s)  in  (A. 74)  and  evaluating  the 
limit,  (A. 71)  is  obtained. 

Q.B.D 
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A.  10 


Let  Ge(x)  be  given  by  (A. 39)  and  let 


^ 2<X>  “ 10  7 d(*AX{Ge(,c  - V - e X<T2  Tl>  Ge(x  - T2)}^  (a. 

with  Xt2  < 1 «nd  Tj  < T2.  Then 


75) 


n.  b^o  . -y> . - v -<»2)2«2  - v 

* 2(1  -Xt2)  (l-u2>  2(1  . ltj)l 


(A. 76) 


Proof 


From  (A. 64),  the  Laplace  tranafora  of  eXx(Ge(x)  - e"AT  Ge(x-T)) 
la  denoted  6e(a).  In  a similar  manner  to  (A. 68)  and  (A. 69) 


UH*lT2(x))--±^{.  e'“Tl  G^2"Tl(s)> 


(A. 77) 


T.T,  — aT  T — T 

i£H*  W {*  * Ge2  1<»)>  . (A.  78) 


Than,  aubatltutlng  (A. 64)  for  G*(.)  in  (A. 78)  and  evaluating  the 
(A. 76)  la  obtained. 

Q.E.D 
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T 


